Using Spousal Surnames and DNA to Unravel Male Lines

Posted on August 10, 2017 by Roberta Estes

When Y DNA matching at Family Tree DNA, it’s not uncommon for men to match other males of the same surname who share the same ancestor. In fact, that’s what we hope for, fervently!

However, if you’re stuck downstream, you may need to figure out which of several male children you descend from.

If you’re staring at a brick wall working yourselves back in time, you may need to try working forward, utilizing various types of information, including wives’ surnames.

For all intents and purposes, this is my Vannoy line, in Wilkes County, NC, so let’s use it as an example, because it embodies both the promise and the peril of this approach.

So, there you sit, disconnected from the Vannoy line. That little yellow box is just so depressing. So close, but yet so far. And yes, we’ve already exhausted the available paper trail records, years ago.

We know the lineage back through Elijah Vannoy, who was born between 1784-1786 in Wilkes County, or vicinity. We know my Vannoy cousin Y DNA matches with other men from the Vannoy line upstream of John Francis Vannoy, the known father of four sons in Wilkes County, NC and the first (and only) Vannoy to move from New Jersey to that part of North Carolina.

Therefore, we know who the candidates are to be Elijah’s father, but the connection in the yellow box is missing. Many Wilkes County records have gone missing over the years and births were not recorded in that timeframe. The records from neighboring Ashe County where Daniel Vannoy lived burned during the Civil War, although some records did survive. In other words, the records are rather like Swiss cheese. Welcome to genealogy in the south.

Which of John Francis Vannoy’s four sons does Elijah descend from?

Let’s see what we can discover.

Contact Matches and Ask for Help

The first thing I would do is to ask for assistance from your surname matches.

Let’s say that you match a known descendant of each of these four men, meaning each of John Francis Vannoy’s sons. Ask each person if they know where the male Vannoy descendants of each son went along with any documentation they might have. If your ancestor, Elijah in this case, is not found in the same location as the sons, geography may be your friend.

In our case, we know that Francis Vannoy migrated to Knox County, Kentucky, but that was after he signed for his daughter’s marriage in Wilkes Co., NC in 1812. It was also about this time that Elijah Vannoy migrated to Claiborne County, TN, in the same direction, but not the same location. The two locations are an hour away by car today, separated by mountains and the Cumberland Gap, a nontrivial barrier.

We also know that Nathaniel Vannoy left a Bible that did not list Elijah as one of his children, but with a gap large enough to possibly encompass another child. If you’re thinking to yourself, “Who would leave a child’s birth out of the Bible?,” I though the same thing until I encountered it myself personally in another line. However, the Bible record does make Nathaniel a less likely father candidate, despite a persistent rumor that Nathaniel was Elijah’s father.

Our only other clues are some tax records recording the number of children in the household of various ages, but none are conclusive. None of these men had wills.

Y DNA Genetic Distance

Your Y DNA matches will show how many mutations you are from them at a particular marker level.

Please note that you can click to enlarge any graphic.

The number of mutations between two men is called the genetic distance.

The rule of thumb is that the more mutations, the further back in time the common ancestor. The problem is, the rule of thumb doesn’t always work. DNA mutates when it darned well pleases, not on any clock that we can measure with that degree of accuracy – at least not accurately enough to tell which of 4 sons a man descends from – unless that line has incurred a defining mutation between the ancestor and the current generation. We call those line marker mutations. To determine the mutation history, you need multiple men from each line to have tested.

You can read more about Y DNA matching in the article, Concepts – Y DNA Matching and Connecting with your Paternal Ancestor.

Check Autosomal DNA Tests

Next, check to see if your Y DNA matches from all Vannoy lines have also taken the autosomal Family Finder test, noted as FF, which shows matches from all ancestral lines, not just the paternal line.

You can see in the match list above that not many have taken the Family Finder test. Ask if they would be willing to upgrade. Be prepared to pay if need be – because you are, after all, the one with the “problem” to solve.

Generally, I simply offer to pay. It’s well worth it to me, and given that paper records don’t exist to answer the question – a DNA test under $100 is cheap. Right now, Family Finder tests are on sale for $69 until the end of the month.

Check for Intermarriage

While you’re waiting for autosomal DNA results, check the pedigrees for all for lines involved to see if you are otherwise related to these men or their wives.

For example, in Andrew Vannoy’s wife’s line and Elijah Vannoy’s wife’s line, we have a common ancestor. George Shepherd and Elizabeth Mary Angelique Daye are common to both lines, and John Shepherd’s wife is unknown, so we have one known problem and one unknown surname.

You can tell already that this could be messy, because we can’t really use Andrew Vannoy’s wife’s line to search for matches because Elijah’s line is likely to match through Andrew’s wife since Susannah Shepherd and Lois McNiel share a common lineage. Rats!

We’ll mark these in red to remind ourselves.

Check Advanced Matching

Family Tree DNA provides a wonderful tool that allows you to compare matches of different kinds of DNA. The Advanced Matching tab is found under “Tools and Apps” under the myFTDNA tab at the upper left.

In this case, I’m going to use the Advanced Match feature to see which of my Vannoy cousin’s Y matches at 37 markers, within the Vannoy DNA project, also match him autosomally.

This report is particularly nice, because it shows number of Y mutations, often indicating distance to a common ancestor, as well as the estimated autosomal relationship range.

You can see in this case that the first Vannoy male, “A,” is a close match both on Y DNA and autosomally, with 1 mutation difference and falling in the 2^nd to 4^th cousin range, as compared to the second Vannoy male, “D,” who is 3 mutations different and falls into the 4^th to remote cousin range.

Not every Vannoy male may have joined the Vannoy project, so you’ll want to run this report a second time, replacing the Vannoy project search criteria with “The Entire Database.”

Unfortunately, not everyone that I need has taken the Family Finder test, so I’ll be contacting a few men, asking if I can sponsor their upgrades.

Let’s move on to our next tactic, using the wives’ surnames.

Search Utilizing the Wife’s Surname

We already know that we can’t rely on the Shepherd surname, so we’ll have to utilize the surnames of the other three wives:

Millicent Henderson – parents Thomas Henderson born circa 1730 Virginia, died 1806 Laurens, SC, wife Frances, surname unknown
Elizabeth Ray (Raye) – parents William Ray born circa 1725/1730 Herdford, England, died 1783 Wilkes Co., NC (the portion now Ashe Co.,) wife Elizabeth Gordon born circa 1783 Amherst Co., VA and died 1804 Surry Co., NC
Sarah Hickerson – parents Charles Hickerson born circa 1725 Stafford Co., VA, died before 1793 Wilkes Co., NC, wife Mary Lytle

Utilizing the Family Finder match search function, I’m going to search for matches that include the wives surnames, but are NOT descended from the Vannoy line.

Hickerson produced no non-Vannoy matches utilizing the matches of my first Vannoy cousin, but Henderson is another matter entirely.

Since the Henderson line would be on my cousin’s father’s side, the matches that are most relevant are the ones phased to his paternal line, those showing the blue person icon.

The surname that you have entered as the search criteria will show as blue in the Ancestral Surname list, at far right, and other matching surnames will show as black. Please note that this includes surnames from ANY person in the match’s tree if they have uploaded a Gedcom file, not just surnames of direct ancestral lines. Therefore, if the match has a tree, it’s important to click on the pedigree icon and search for the surname in question. Don’t assume.

Altogether, there are 76 Henderson matches, of which 17 are phased to his paternal line. You’ll need to review each one of at least the 17. Personally, I would painstakingly review each one of the 76. You never know where a shred of information will be found.

Please note, finding a match with a common surname DOES NOT MEAN THAT YOU MATCH THIS PERSON THROUGH THAT SURNAME. Even finding a person with a common ancestor doesn’t mean that you both descend from that ancestor. You may have a second common ancestor. It means that you have more work to do, as proof, but it’s the beginning you need.

Of course, the first thing we need to do is eliminate any matches who also descend from a Vannoy, because there is no way to know if the matching DNA is through the Vannoy or Henderson lines. However, first, take note of how that person descends from the Vannoy line.

You can see your matches entire surname list by clicking on their profile picture.

The surname, Ray, is more difficult, because the search for Ray also returns names like Bray and Wray, as well as Ray.

But Wait – There’s a Happy Ending!

If you’re thinking, “this is a lot of work,” yes, it is.

Yes, you are absolutely going to do the genealogy of the wives’ lines so you can recognize if and how your matches might connect.

I enter the wives’ lines into my genealogy software and then I search for the ancestors found in my matches trees to see if they descend from that line.

One tip to make this easier is to test multiple people in the same line – regardless of whether they are males or carry the desired surname. They simply need to be descendants – that’s the beauty of autosomal DNA and why I carry kits with me wherever I go. And yes, I’m really serious about that!

When you have multiple testers from the same line, you can utilize each test independently, searching for each surname in the Family Finder results. Then, from the surname match list, select a sibling or other close relative with that same surname in their list, then choose the ICW feature. This allows you to see who both of those people match who also carries the Henderson surname in their surname list.

Not successful with that initial cousin’s match results – like I wasn’t with Hickerson?

Rinse and repeat, with every single person who you can find who has descended from the line in question. I started the process over again with a second cousin and a Hickerson search.

About the time you’re getting really, really tired of looking at all of those trees, extending the branches of other people’s lines, and are about to give up and go to bed because it’s 3 AM and you’re discouraged, you see something like this:

Yep, it’s good old Charles Hickerson and Mary Lytle. I could hardly believe my eyes!!! This Hickerson match to a cousin in my Vannoy line descends from Charles Hickerson’s son, Joshua.

All of a sudden…it’s all worthwhile! Your fatigue is gone, replaced by adrenalin and you couldn’t sleep now if your life depended on it!

Using the ICW (in common with feature) to find additional known cousins who match the person with Charles Hickerson and Mary Lytle in their tree, I found a total of three Vannoy cousins with significant matches.

Using the chromosome browser to compare, I’ve confirmed that one segment is a triangulated match of 12.69 cM (blue) on chromosome 2.

You can read more about triangulation in the article, Concepts – Why Genetic Genealogy and Triangulation? as well as the article, Concepts – Match Groups and Triangulation.

Do I wish I had more than three people in my triangulation group? Yes, of course, but with a match of this size triangulated between cousins and a Hickerson descendant who is a 30 year genealogist, sporting a relatively complete tree and no other common lines, it’s a great place to begin digging deeper! This isn’t the end, but a new beginning!

After obsessively digging through the matches of every Elijah Vannoy descended cousin I can find (sleep is overrated anyway) and whose account I have access to, I have now discovered matches with four additional people who have no other common lines with the Vannoy cousins and who descend from Charles Hickerson and Mary Lytle through sons David and Joseph Hickerson. I can’t tell if they triangulate without access to accounts that I don’t have access to, so I’ve sent e-mails requesting additional information.

WooHoo Happy Day!!! There’s a really big crack in the brick wall and I’ve just witnessed the sunrise of a beautiful, amazing day.

I think Elijah’s parents are…drum roll…Daniel Vannoy and Sarah Hickerson!

Which walls do you need to fall and how can you use this technique?

______________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Legacy Tree Genealogists for genealogy research

Concepts – Why Genetic Genealogy and Triangulation?

Posted on May 16, 2017 by Roberta Estes

One of the questions often asked is why triangulation in genetic genealogy is so important.

Before I answer that, let’s take a look at why genealogists use autosomal DNA for genetic genealogy in the first place.

Why Genetic Genealogy?

Aside from ethnicity testing, genetic genealogists utilize autosomal DNA testing to further their genealogical research or confirm the research they have already performed. Genetic genealogy cannot stand alone on DNA evidence, but must include traditional genealogical research. DNA is simply another tool in the genealogist’s tool box – albeit a critical one.

There are three established primary vendors in this field, Family Tree DNA, Ancestry and 23andMe, plus a few newcomers. All three vendors offer autosomal DNA tests utilized by genetic genealogists in various ways. If you want to learn more about the differences between these vendors’ offerings, please read the article, “Which DNA Test is Best?”

In order to achieve genealogical goals, there are four criteria that need to be met. All are required to achieve triangulation which is the only way to confirm a genealogical ancestral match to a specific ancestor.

DNA Matching – The tester’s DNA matches that of other testers at the company where they tested, or at GedMatch. All three vendors provide matching information, along with GedMatch, a third-party tool utilized by genetic genealogists.

Family Tree DNA assigns matches to either maternal, paternal or both sides of the tester’s tree based on connecting the DNA of relatives, up through third cousins, who have tested to their appropriate location in the tester’s tree.

In the example above, you can see the individuals linked to my tree include my mother with her Family Finder test, plus her two first cousins, Donald and Cheryl Ferverda who have also tested.

Ancestor Matching – The testers identify a common ancestor or ancestral line based on their previous work, aka, genealogy and family trees. In the example above, the common ancestors are the parents of the brothers, John and Roscoe Ferverda. Identifying a common ancestor is an easy task with known close relatives, but becomes more challenging the more distant the common ancestor.

Of the vendors, 23andMe does not have a Gedcom upload or ability for testers to display trees and for the vendor to utilize to match surnames, although they can link to external trees. Ancestry provides “tree matching,” shown above, and Ancestry and Family Tree DNA, shown below, both provide surname matching.

Segment Matching – Utilizing chromosome browsers or downloaded match lists including segment information to identify actual DNA segments that match other testers.

Family Tree DNA’s chromosome browser is shown above.

Each individual tester will have two groups of matches on the same segment, one group from their mother’s side of the tree and one from their father’s side of the tree. Each tester carries DNA inherited from both parents on two different “sides” of each chromosome. You can read more about that in the article, One Chromosome, Two Sides, No Zipper – ICW and the Matrix.

Of the three vendors, Ancestry does not provide segment matching, a chromosome browser, nor any segment information, so testers cannot perform this step at Ancestry.

23andMe does provide this information, but each tester must individually “opt in” to data sharing, and many do not. If testers do not globally “opt in” they must authorize sharing individually for every match, so testers will not be able to see the chromosome segment information for many 23andMe matches. In my case, only about 60% are sharing.

Family Tree DNA provides a chromosome browser, the file download capability with segment information, and everyone authorizes sharing of information when they initially test – so there is no opt-in confusion.

Ancestry and 23andMe raw DNA data files can be transferred to both Family Tree DNA and GedMatch where chromosome browsers and other tools are available. For more information about transferring files, please read Autosomal DNA Transfers – Which Companies Accept Which Tests?

Triangulation – The process used to combine all three of the above steps in order to assign specific segments of the tester’s DNA to specific ancestors, by virtue of:

The tester’s DNA matching the DNA of other testers on a specific segment.
Identifying that the individuals who match the tester on that segment also match each other. This is part of the methodology employed to group the testers matches into two groups, the maternal and paternal groupings.
Identifying which ancestor contributed that segment to all of the people who match the tester and each other on that same segment.

In order for a group of matches to triangulate, they must match each other on the same segment of DNA and they must all share a common ancestor.

Triangulation is part DNA, meaning the inheritance, part technology, meaning the ability to show that all testers in a match group all match each other and on the same segment, and part genealogy, meaning the ability to identify the common ancestor of the group of individuals.

The following chart shows a portion of my match download file on chromosome 5 from Family Tree DNA.

As you can see, these matches all cover significant portions of the same segment on chromosome 5.

Without further investigation, we know that I match all of these people, but we don’t know what that information is telling us about my genealogy. We don’t know who matches each other, and we can’t tell which people are from my mother’s and father’s sides. We also don’t know who the common ancestor is or common ancestors are.

However, looking at the trees of the individuals involved, or contacting them for further information, and/or recognizing known cousins from a specific line all combine to contribute to the identification of our common ancestors.

Below is the same spreadsheet, now greatly enriched after my genealogy work is applied to the DNA matches in two additional columns.

I’ve colored my triangulated groups pink for my mother’s side and blue for my father’s side.

In this case, I also have access to my cousins’ DNA match results, so I can view their matches as well, looking for common matches on my match list.

One of the reasons genealogists always suggest testing older family members and as many cousins as possible is because triangulation becomes much easier with known cousins from particular lines to point the way to the common ancestor. In this case, one cousin, Joe, is from my mother’s side and one, Lou, is from my father’s side.

By looking at my matches’ genealogy, I’ve now been able to assign this particular segment on chromosome 5, on my mother’s side to ancestors Johann Michael Miller and his wife Susanna Berchtol. The same segment, on my father’s side is inherited from Charles Dodson and his wife, Ann, last name unknown.

In order to achieve triangulation, the common ancestor must be determined for the match group. Once triangulation is achieved, descent from the common ancestor is confirmed.

Unless you are dealing with very close known relatives, like the Ferverda first cousins, there is no other way to prove a genetic connection to a specific ancestor.

At Family Tree DNA, I can utilize the chromosome browser and the ICW and matrix tools to determine which of this group matches each other. At 23andMe, I can utilize their shared DNA matching tool. This information can then be recorded in my DNA spreadsheet, as illustrated above.

Triangulation cannot be achieved at Ancestry or utilizing their tools. Ancestry’s DNA Circles provide extended match groups, indicating who matches whom for a particular ancestor shown in a tester’s tree, but do not indicate that the matches are on the same segment. Circles do not guarantee that Circle members are matching on DNA from that ancestor, only that they do match and show a common ancestor in their tree. The third triangulation step of segment matching is missing. Ancestry does not provide segment information in any format, so Ancestry customers who want to triangulate can either retest elsewhere or download their data files to either Family Tree DNA or GedMatch for free.

Summary

Before the advent of genetic genealogy, genealogists had to take it on faith that the paper trail was accurate, and that there was no misattributed parentage – either through formal or informal adoption or hanky-panky. That’s not the case anymore.

Today, DNA through triangulation can prove ancestry for groups of people to a common ancestor by identifying segments that have descended from that ancestor and are found in multiple descendants today.

Of course, the next step is to break down those remaining brick walls. For example, what is the birth name of Ann, wife of Charles Dodson, whose surname is unknown? Logically, the DNA descended from a couple, meaning Charles and Ann, contains DNA from both individuals. We don’t know if that segment on chromosome 5 is from Ann, Charles, or parts from both, BUT, if we begin to see a further breakdown to another, unknown family line among the Charles and Ann segments, that might be a clue.

One day, in the future, we’ll be able to identify our unknown family lines through DNA matches and other people’s triangulation. That indeed, is the Holy Grail.

Additional Resources

If you’d like to read more specific information about autosomal DNA matching and triangulation, be sure to read the links in the article, above. The following articles may be of interest as well:

If you think you might come up short, because you have only one known cousin who has tested, well, think again.

Just One Cousin

Here’s wishing you lots of triangulated matches!!!

______________________________________________________________

Disclosure

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Legacy Tree Genealogists for genealogy research

Concepts – The Faces of Endogamy

Posted on March 10, 2017 by Roberta Estes

Recently, while checking Facebook, I saw this posting from my friend who researches in the same Native admixed group of families in North Carolina and Virginia that I do. Researchers have been trying for years to sort through these interrelated families. As I read Justin’s post, I realized, this is a great example of endogamy and often how it presents itself to genealogists.

I match a lot of people from the Indian Woods [Bertie County, NC] area via DNA, with names like Bunch, Butler, Mitchell, Bazemore, Castellow, and, of course, Collins. While it’s hard to narrow in on which family these matching segments come from, I can find ‘neighborhoods’ that fit the bill genetically. This [census entry] is from near Quitsna in 1860. You see Bunch, Collins, Castellow, Carter, and Mitchell in neighboring households.

Which begs the question, what is endogamy, do you have it and how can you tell?

Definition

Endogamy is the practice or custom or marrying within a specific group, population, geography or tribe.

Examples that come to mind are Ashkenazi Jews, Native Americans (before European and African admixture), Amish, Acadians and Mennonite communities.

Some groups marry within their own ranks due to religious practices. Jewish, Amish and Mennonite would fall under this umbrella. Some intermarry due to cultural practices, such as Acadians, although their endogamy could also partly be attributed to their staunch Catholic beliefs in a primarily non-Catholic region. Some people practice endogamy due to lack of other eligible partners such as Native Americans before contact with Europeans and Africans. People who live on islands or in villages whose populations were restricted geographically are prime candidates for endogamy.

In the case of Justin’s group of families who were probably admixed with Native, European and African ancestors, they intermarried because there were socially no other reasonable local options. In Virginia during that timeframe, mixed race marriages were illegal. Not only that, but you married who lived close by and who you knew – in essence the neighbors who were also your relatives.

Endogamy and Genetic Genealogy

In some cases, endogamy is good news for the genealogist. For example, if you’re working with Acadian records and know which Catholic church your ancestors attended. Assuming those church records still exist, you’re practically guaranteed that you’ll find the entire family because Acadians nearly always married within the Acadian community, and the entire Acadian community was Catholic. Catholics kept wonderful records. Even when the Acadians married a Native person, the Native spouse is almost always baptized and recorded with a non-Native name in the Catholic church records, which paved the way for a Catholic marriage.

In other cases, such as Justin’s admixed group, the Brethren who notoriously kept no church records or the Jewish people whose records were largely destroyed during the Holocaust, endogamy has the opposite effect – meaning that actual records are often beyond the reach of genealogists – but the DNA is not.

It’s in cases like this that people reach for DNA to help them find their families and connections.

What Does Endogamy Look Like?

If you know nothing about your heritage, how would you know whether you are endogamous or not? What does it look like? How do you recognize it?

The answer is…it depends. Unfortunately, there’s no endogamy button that lights up on your DNA results, but there are a range of substantial clues. Let’s divide up the question into pieces that make sense and look at a variety of useful tools.

Full or Part?

First of all, fully and partly endogamous ancestry, and endogamy from different sources, has different signs and symptoms, so to speak.

A fully endogamous person, depending on their endogamy group, may have either strikingly more than average autosomal DNA matches, or very few.

Another factor will be geography, where you live, which serves to rule out some groups entirely. If you live in Australia, your ancestors may be European but they aren’t going to be Native American.

How many people in your endogamous group that have DNA tested is another factor that weighs very heavily in terms of what endogamy looks like, as is the age of the group. The older the group, generally the more descendants available to test although that’s not always the case. For example warfare, cultural genocide and disease wiped out many or most of the Native population in the United States, especially east of the Mississippi and particularly in the easternmost seaboard regions.

Because of the genocide perpetrated upon the Jewish people, followed by the scattering of survivors, Jewish descendants are inclined to test to find family connections. Jewish surnames may have been changed or not adopted in some cases until late, in the 1800s, and finding family after displacement was impossible in the 1940s for those who survived.

Let’s look at autosomal DNA matches for fully and partly endogamous individuals.

Jewish people, in particular Ashkenazi, generally have roughly three times as many matches as non-endogamous individuals.

Conversely, because very few Native people have tested, Native testers, especially non-admixed Native individuals, may have very few matches.

It’s ironic that my mother, the last person listed, with two endogamous lines, still has fewer matches than I do, the first person listed. This is because my father has deep colonial roots with lots of descendants to test, and my mother has recent immigration in her family line – even though a quarter of her ancestry is endogamous.

To determine whether we are looking at endogamy, sometimes we need to look for other clues.

There are lots of ways to discover additional clues.

Surnames

Is there a trend among the surnames of your matches?

At the top of your Family Finder match page your three most common surnames are displayed.

A fully endogamous Jewish individual’s most common surnames are shown above. If you see Cohen among your most common surnames, you are probably Jewish, given that the Kohanim have special religious responsibilities within the Jewish faith.

Of course, especially with autosomal DNA, the person’s current surname may not be indicative, but there tends to be a discernable pattern with someone who is highly endogamous. When someone who is fully endogamous, such as the Jewish population, intermarries with other Jewish people, the surnames will likely still be recognizably Jewish.

Our Jewish individual’s first matching page, meaning his closest matches, includes the following surnames:

Cohen
Levi
Bernstein
Kohn
Goldstein

The Sioux individual only has 137 matches, but his first page of matches includes the following surnames:

Sunbear
Deer With Horns
Eagleman
Yelloweyes
Long Turkey
Fire
Bad Wound
Growing Thunder

These surnames are very suggestive of Native American ancestry in a tribe that did not adopt European surnames early in their history. In other words, not east of the Mississippi.

At Family Tree DNA, every person has the opportunity to list their family surnames and locations, so don’t just look at the tester’s surname, but at their family surnames and locations too. The Ancestral Surname column is located to the far right on the Family Finder matches page. If you can’t see all of the surnames, click on the person’s profile picture to see their entire profile and all of the surnames they have listed.

Please note that you can click to enlarge all graphics.

If you haven’t listed your family surnames, now would be a good time. You can do this by clicking on the orange “Manage Personal Information” link near your profile picture on the left of your personal page.

The orange link takes you to the account settings page. Click on the Genealogy tab, then on surnames. Be sure to click the orange “save” when you are finished.

Partial Endogamy

Let’s take a look at a case study of someone who is partially endogamous, meaning that they have endogamous lines, but aren’t fully endogamous. My mother, who is the partially endogamous individual with 1231 matches is a good example.

Mother is a conglomeration of immigrants. Her 8 great-grandparents break down as follows:

In mother’s case, a few different forces are working against each other. Let’s take a look.

The case of recent immigration from the Netherlands, in the 1850s, would serve to reduce mother’s matches because there has been little time in the US for descendants to accrue and test. Because people in the Netherlands tend to be very reluctant about DNA testing, very few have tested, also having the effect of reducing her number of matches.

Mother’s Dutch ancestors were Mennonites, an endogamous group within the Netherlands, which would further reduce her possibilities of having matches on these lines since she would be less likely to match the general population and more likely to match individuals within the endogamous group. If people from the Mennonite group tested, she would likely match many within that group. In other words, for her to find Dutch matches, people descended from the endogamous Dutch Mennonite population would need to test. At Family Tree DNA, there is a Low Mennonite Y DNA and Anabaptist autosomal DNA project both, but these groups tend to attract the Mennonites that migrated to Russia and Poland, not the group that stayed in the Netherlands. Another issue, at least in mother’s case, is that her Mennonite relatives “seem” to have been later converts, not part of the original Mennonite group – although it’s difficult to tell for sure in the records that exist.

Mother’s Kirsch and Drechsel ancestors were also recent immigrants in the 1850s, from Germany, with very few descendants in the US today. The villages from where her Kirsch ancestors immigrated, based on the church records, did tend to be rather endogamous. However, that endogamy would only have reached back about 200 years, as far as the 30 Years’ War when that region was almost entirely, if not entirely, depopulated. So while there was recent endogamy, there (probably) wasn’t deep endogamy. Of course, it would require someone from those villages to test so mother could have matches before endogamy can relevant. DNA testing is not popular in Germany either.

Because of recent immigration, altogether one half of mother’s heritage would reduce her number of matches significantly. Recent immigrants simply have fewer descendants to test.

On the other hand, mother’s English line has been in the US for a long time, some since the Mayflower, so she could expect many matches from that line, although they are not endogamous. If you’re thinking to yourself that deep colonial ancestry can sometime mimic endogamy in terms of lots of matches, you’re right – but still not nearly to the level of a fully endogamous Jewish person.

Mother’s Acadian line has been settled in North America in Nova Scotia since the early 1600s, marrying within their own community, mixing with the Native people and then scattering in different directions after 1755 when they were forcibly removed. Acadians, however, tended to remain in their cultural groups, even after relocation. Many Acadian descendants DNA test and all Acadians descend from a limited and relatively well documented original population. That level of documentation is very unusual for endogamous groups. Acadian surnames are well known and are French. The best Acadian genealogical resource in is Karen Theriot’s comprehensive tree on Rootsweb in combination with the Mothers of Acadia DNA project at Family Tree DNA. I wish there was a similar Fathers of Acadia project.

Mother’s Brethren line is much less well documented due to a lack of church records. The Brethren community immigrated in the early 1700s from primarily Switzerland and Germany, was initially relatively small, lived in clusters in specific areas, traveled together and did not marry outside the Brethren faith. Therefore, Brethren heritage and names also tend to be rather specific, but not as recognizable as Acadian names. After all, the Brethren were German/Swiss and in mother’s case, she also has another 1/4^th of her heritage that are recently immigrated Germans – so differentiating one German group from the other can be tricky. The only way to tell Brethren matches from other German matches is that the Brethren also tend to match each other.

In Common With

If you notice a group of similar appearing surnames, use the ICW (in common with) tool at Family Tree DNA to see who you match in common with those individuals. If you find that you match a whole group of people with similar surnames or geography, contact your matches and ask if they know any of the other matches and how they might be related. I always recommend beginning with your closest matches because your common ancestor is likely to be closer in time than people who match you more distantly.

In the ICW match example below, all of the matches who do show ancestral surnames include Acadian surnames and/or locations.

Acadians, of course, became Cajuns in Louisiana where one group settled after their displacement in Nova Scotia. The bolded surnames match surnames on the tester’s surname list.

The ICW tools work particular well if you know of or can identify one person who matches you within a group, or simply on one side of your family.

Don Worth’s Autosomal DNA Segment Analyzer is an excellent tool to genetically group your matches by chromosome. It’s then easy to use the chromosome browser at Family Tree DNA to see which of these people match you on the same segments. These tools work wonderfully together.

The group above is an Acadian match group. By hovering over the match names, you can see their ancestral surnames which make the Acadian connection immediately evident.

The Matrix

In addition to seeing the people you match in common with your matches by utilizing the ICW tool at Family Tree DNA, you can also utilize the Matrix tool to see if your matches also match each other. While this isn’t the same as triangulation, because it doesn’t tell you if they match each other on the same exact segment, it’s a wonderful tool, because in the absence of cooperation or communication from your matches to determine triangulation between multiple people, the Matrix is a very good secondary approach and often predicts triangulation accurately.

In the Matrix, above, the blue boxes indicates that these individuals (from your match list) also match each other.

For additional information on various autosomal tools available for your use, click here to read the article, Nine Autosomal Tools at Family Tree DNA.

MyOrigins

Everyone who takes the Family Finder test also receives their ethnicity estimates on the MyOrigins tab.

In the case of our Jewish friend, above, his MyOrigins map clearly shows his endogamous heritage. He does have some Middle Eastern region admixture, but I’ve seen Ashkenazi Jewish results that are 100% Ashkenazi Jewish.

The same situation exists with our Sioux individual, above. Heavily Native, removing any doubt about his ancestry.

However, mother’s European admixture blends her MyOrigins results into a colorful but unhelpful European map, at least in terms of determining whether she is endogamous or has endogamous lines.

European endogamous admixture, except for Jewish heritage, tends to not be remarkable enough to stand out as anything except European heritage utilizing ethnicity tools. In addition, keep in mind that DNA testing in France for genealogy is illegal, so often there is a distinct absence in that region that is a function of the lack of testing candidates. Acadians may not show up as French.

Ethnicity testing tends to be excellent at determining majority ethnicity, and determining differences between continental level ethnicity, but less helpful otherwise. In terms of endogamy, Jewish and Native American tend to be the two largest endogamous groups that are revealed by ethnicity testing – and for that purpose, ethnicity testing is wonderful.

Y and Mitochondrial DNA and Endogamy

Autosomal tools aren’t the only tools available to the genetic genealogist. In fact, if someone is 100% endogamous, or even half endogamous, chances are very good that either the Y DNA for males on the direct paternal line, or the mitochondrial DNA for males and females on the direct matrilineal line will be very informative.

On the pedigree chart above, the blue squares represent the Y DNA that the father contributes to only his sons and the red circles represent the mitochondrial DNA (mtDNA) that mothers contribute to both genders of their children, but is only passed on by the females.

By utilizing Y and mtDNA testing, you can obtain a direct periscope view back in time many generations, because the Y and mitochondrial DNA is preserved intact, except for an occasional mutation. Unlike autosomal DNA, the DNA of the other parent is not admixed with the Y or mitochondrial DNA. Therefore, the DNA that you’re looking at is the DNA of your ancestors, generations back in time, as opposed to autosomal DNA which can only reliably reach back 5 or 6 generations in terms of ethnicity because it gets halved in every generation and mixed with the DNA of the other parent.

With autosomal DNA, we can see THAT it exists, but not who it came from. With Y and mtDNA DNA, we know exactly who in your tree that specific DNA came from

We do depend on occasional Y and mtDNA mutations to allow our lines to accrue enough mutations to differentiate us from others who aren’t related, but those mutations accrue very slowly over hundreds to thousands of years.

Our “clans,” over time, are defined by haplogroups and both our individual matches and our haplogroup or clan designation can be very useful. Your haplogroup will indicate whether you are European, Jewish, Asian, Native American or African on the Y and/or mtDNA line.

In cases of endogamous groups where the members are known to marry only within the group, Y and mtDNA can be especially helpful in identifying potential families of origin. This is evident in the Mothers of Acadia DNA project as well a particular brick wall I’m working on in mother’s Brethren line. Success, of course, hinges on members of that population testing their Y or mtDNA and being available for comparison.

Always test your Y (males only) and mitochondrial DNA (males and females.) You don’t know what you don’t know, and sometimes those lines may just hold the key you’re looking for. It would be a shame to neglect the test with the answer, or at least a reasonably good hint! Stories of people discovering their ethnic heritage, at least for that line, by taking a Y or mtDNA test are legendary.

Jewish Y and Mitochondrial DNA

Fortunately, for genetic genealogists, Jewish people carry specific sub-haplogroups that are readily identified as Jewish, although carrying these subgroups don’t always mean you’re Jewish. “Jewish” is a religion as well as a culture that has been in existence as an endogamous group long enough in isolation in the diaspora areas to develop specific mutations that identify group members. Furthermore, the Jewish people originated in the Near East and are therefore relatively easy, relative to Y and mtDNA, to differentiate from the people native to the regions outside of the Near East where groups of Jewish people settled.

The first place to look for hints of your heritage is your main page at Family Tree DNA. First, note your haplogroups and any badges you may have in the upper right hand corner of your results page.

In this man’s case, the Cohen badge is this man’s first clue that he matches or closely matches the known DNA signature for Jewish Cohen men.

Both Y DNA and mitochondrial DNA results have multiple tabs that hold important information.

Two tabs, Haplogroup Origins and Ancestral Origins are especially important for participants to review.

The Haplogroup Origins tab shows a combination of academic research results identifying your haplogroup with locations, as well as some Ancestral Origins mixed in.

A Jewish Y DNA Haplogroup Origins page is shown above.

The Ancestral Origins page, below, reflects the location where your matches SAY their most distant direct matrilineal (for mtDNA) or patrilineal (for Y DNA) ancestors were found. Clearly, this information can be open to incorrect interpretation, and sometimes is. For example, people often don’t understand that “most distant maternal ancestor” means the direct line female on your mother’s mother’s mother’s side. However, you’re not looking at any one entry. You are looking instead for trends.

The Ancestral Origins page for a Jewish man’s Y DNA is shown above.

The Haplogroup Origins page for Jewish mitochondrial DNA, below, looks much the same, with lots of Ashkenazi entries.

The mitochindrial Ancestral Origins results, below, generally become more granular and specific with the higher test levels. That’s because the more general results get weeded out a higher levels. Your closest matches at the highest level of testing are the most relevant to you, although sometimes people who tested at lower levels would be relevant, if they upgraded their tests.

Native American Y and Mitochondrial DNA

Native Americans, like Jewish people, are very fortunate in that they carry very specific sub-haplogroups for Y and mitochondrial DNA. The Native people had a very limited number of founders in the Americas when they originally arrived, between roughly 10,000 and 25,000 years ago, depending on which model you prefer to use. Descendants had no choice but to intermarry with each other for thousands of years before European and African contact brought new genes to the Native people.

Fortunately, because Y and mtDNA don’t mix with the other parents’ DNA, no matter how admixed the individual today, testers’ Y and mtDNA still shows exactly the origins of that lineage.

Native American Y DNA shows up as such on the Haplogroup Origins and Ancestral Origins tabs, as illustrated below.

The haplogroup assigned is shown along with a designation as Native on the Haplogroup Origins and Ancestral Origins pages. The haplogroup is assigned through DNA testing, but the Native designation and location is entered by the tester. Do be aware that some people record the fact that their “mother’s side” or “father’s side” is reported to have a Native ancestor, which is not (necessarily) the same as the matrilineal or patrilineal line. Their “mother’s side” and “father’s side” can have any number of both male and female ancestors.

If the tester’s haplogroup comes back as non-Native, the erroneous Native designation shows up in their matches Ancestral Origins page as “Native,” because that is what the tester initially entered. I wrote about this situation here, but there isn’t much that can be done about this unless the tester either realizes their error or thinks to go back and change their designation from Native American when they realize the DNA does not support the family story, at least not on this particular line line. Erroneous labeling applies to both Y and mtDNA.

Native Y DNA falls within a subset of haplogroups C and Q. However, most subgroups of C and Q are NOT Native, but are European or Asian or in one case, a subgroup of haplogroup Q is Jewish. This does NOT means that the Jewish people and the Native people are related within many thousands of years. It means they had a common ancestor in Asia thousands of years ago that gave birth to both groups. In essence, one group of the original Q moved east and eventually into the Americas, and one moved west, winding up in Europe. Today, mutations (SNPs) have accrued to each group that very successfully differentiate them from one another. In order to determine whether your branch of C or Q is Native, you must take additional SNP tests which further identify your haplogroup – meaning which branch of haplogroup C or Q that you belong to.

Native Americans Y-DNA, to date, must fall into a subset of haplogroup C-P39, a subgroup of C-M217 or Q-M3, Q-M971/Z780 or possibly Q-B143 (ancient Saqquq in Greenland), according to The study of human Y chromosome variation through ancient DNA. Each of these branches also has sub-branches except for Q-B143 which may be extinct. This isn’t to say additional haplogroups or sub-haplogroups won’t be discovered in the future. In fact, haplogroup O is a very good candidate, but enough evidence doesn’t yet exist today to definitively state that haplogroup O is also Native.

STR marker testing, meaning panels of markers from 12-111, provides all participants with a major haplogroup estimate, such as C or Q. However, to confirm the Y DNA haplogroup subgroup further down the tree, one must take additional SNP testing. I wrote an article about the differences between STR markers and SNPs, if you’d like to read it, here and why you might want to SNP test, here.

Testers can purchase individual SNPs, such as the proven Native SNPs, which will prove or disprove Native ancestry, a panel of SNPs which have been combined to be cost efficient (for most haplogroups), or the Big Y test which scans the entire Y chromosome and provides additional matching.

When financially possible, the Big Y is always recommended. The Big Y results for the Sioux man showed 61 previously unknown SNPs. The Big Y test is a test of discovery, and is how we learn about new branches of the Y haplotree. You can see the most current version of the haplogroup C and Q trees on your Family Tree DNA results page or on the ISOGG tree.

Native mitochondrial DNA can be determined by full sequence testing the mitochondrial DNA. The mtPlus test only tests a smaller subset of the mtDNA and assigns a base haplogroup such as A. To confirm Native ancestry, one needs to take the full sequence mitochondrial test to obtain their full haplogroup designation which can only be determined by testing the full mitochondrial sequence.

Native mitochondrial haplogroups fall into base haplogroups A, B, C, D, X and M, with F as a possibility. The most recent paper on Native Mitochondrial DNA Discoveries can be found here and a site containing all known Native American mitochondrial DNA haplogroups is here.

Not Native or Jewish

Unfortunately, other endogamous groups aren’t as fortunate as Jewish and Native people, because they don’t have haplogroups or subgroups associated with their endogamy group. However, that doesn’t mean there aren’t a few other tools that can be useful.

Don’t forget about your Matches Maps. While your haplogroup may not be specific enough to identify your heritage, your matches may hold clues. Each individual tester is encouraged to enter the identity of their most distant ancestor in both their Y (if male) and mtDNA lines. Additionally, on the bottom of the Matches Map, testers can enter the location where that most distant ancestor is found. If you haven’t done that yet, this is a good time to do that too!

When looking at your Matches Map, clusters and distribution of your matches most distant ancestor locations are important.

This person’s matches, above, suggest that they might look at the history of Nova Scotia and French immigrants – and the history of Nova Scotia is synonymous with the Acadians but the waterway distribution can also signal French, but not Acadian. Native people are also associated with Nova Scotia and river travel. The person’s haplogroup would add to this story and focus on or eliminate some options.

This second example above, suggests the person look to the history of Norway and Sweden, although their ancestor, indicated by the white balloon, is from Germany. If the tester’s genealogy is stuck in the US, this grouping could be a significant clue relative to either recent or deeper history. Do they live in a region where Scandinavian people settled? What history connects the region where the ancestor is found with Scandinavia?

This third example, above, strongly suggests Acadian, given the matches restricted to Nova Scotia, and, as it turns out, this individual does have strong Acadian heritage. Again, their haplogroup is additionally informative and points directly to the European or Native side of the Acadian heritage for this particular line.

In Summary

Sometimes endogamy is up front and in your face, evident from the minute your DNA results are returned. Other times, endogamous lines in ethnically mixed individuals reveal themselves more subtly, like with my friend Justin. Fortunately, the different types of DNA tests and the different tools at our disposal each contain the potential for a different puzzle piece to be revealed. Many times, our DNA results need to be interpreted with some amount of historical context to reveal the story of our ancestors.

When I first discovered that my mother’s line was Acadian, my newly found cousin said to me, “If you’re related to one Acadian, you’re related to all Acadians.” He wasn’t kidding. For that very reason, endogamous genetic genealogy is tricky at best and frustrating at worst.

When possible, Y and mtDNA is the most definitive answer, because the centuries or millennia or intermarriage don’t affect Y and mtDNA. If you are Jewish or Native on the appropriate lines for testing, Y and mtDNA is very definitive. If you’re not Jewish or Native on your Y or mtDNA lines, check your matches for clues, including surnames, Haplogroup and Ancestral Origins, and your Matches Map.

Consider building a DNA pedigree chart that documents each of your ancestors’ Y and mtDNA for lines that aren’t revealed in your own test. The story of Y and mtDNA is not confused or watered down by admixture and is one of the most powerful, and overlooked, tools in the genealogist’s toolbox.

Autosomal DNA when dealing with endogamy can be quite challenging, even when working with well-documented Acadian genealogy – because you truly are related to everyone. Trying to figure out which DNA segments go with, or descend from, which ancestors reaching back several generations is the ultimate jigsaw puzzle. Often, I work with a specific segment and see how far back I can track that segment in the ancestral line of me and my matches. On good days, we arrive at one common ancestor. On other days, we arrive at dead ends that are not a common ancestor – which means of course that we keep searching genealogically – or pick a different segment to work with.

When working with autosomal DNA of endogamous individuals (or endogamous lines of partially endogamous individuals,) I generally use a larger matching threshold than with non-endogamous, because we already know that these people will have segments that match because they descend from the same populations. In general, I ignore anything below 10cM and often below 15cM if I’m looking for a genealogical connection in the past few generations. If I’m simply mapping DNA to ancestors, then I use the smaller segments, down to either 7 or 5cM. If you want to read more about segments that are identical by chance (also known as false matches,) identical by population and identical by descent (genealogically relevant matches,) click here.

The good news about endogamy is that its evidence persists in the DNA of the population, literally almost forever, as long as that “population” exists in descendants – meaning you can find it! In my case, my Acadian brick wall would have fallen much sooner had I know what endogamy looked like and what I was seeing actually meant.

A perfect example of persistent endogamy is that our Sioux male today, along with other nearly fully Native people, including people from South America, matches the ancient DNA of the Anzick child who died and was buried in Montana 12,500 years ago.

These people don’t just match on small segments, but at contemporary matching levels at Family Tree DNA and GedMatch, both. One individual shows a match of 109 total cM and a single largest segment of DNA at 20.7 cM, a match that would indicate a contemporary relationship of between 3.5 and 4 generations distant – meaning 2nd to 3rd cousins. Clearly, that isn’t possible, but the DNA shared by Anzick Child and that individual today has been intact in the Native population for more than 12,500 years.

The DNA that Anzick Child carried is the same DNA that the Sioux people carry today – because there was no DNA from outside the founder population, no DNA to wash out the DNA carried by Anzick Child’s ancestors – the same exact ancestors of the Sioux and other Native or Native admixed people today.

While endogamy can sometimes be frustrating, the great news is that you will have found an entire population of relatives, a new “clan,” so to speak. You’ll understand a lot more about your family history and you’ll have lots of new cousins!

Endogamy is both the blessing and the curse of genetic genealogy!

______________________________________________________________

Disclosure

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Legacy Tree Genealogists for genealogy research

Increasing “In Common With” (ICW) Functionality at Family Tree DNA

Posted on October 26, 2016 by Roberta Estes

You know how Murphy’s Law works, right?

Right after I wrote the article Nine Autosomal Tools at Family Tree DNA, as in minutes later (Ok, that’s probably an exaggeration), Family Tree DNA made a change and the ICW (in common with) tool functioned differently. Murphy lives at my house, I swear!

I initially thought perhaps this was unintended, but it may well be a design change since additional functionality was provided and three months have elapsed.

So regardless of whether or not this change is permanent or will change minutes after I publish this article, I’m providing instructions on how this feature works NOW. If it changes or works differently in the future, I’ll let you know!

In all fairness, it’s the addition of the combination searches, I think, that has caused the confusion. Combo searches are great features and powerful, if you know how to use the functionality correctly for what you want to accomplish.

Let’s take a look at how to utilize the various kinds of searches, individually and in combination, step-by-step.

Example One – Regular “In Common With” Matches

The ICW feature shows you who your matches match in common with you. I’ve signed on as my mother for these examples to illustrate this feature since she is a generation more closely related to these folks than I am.

First, let’s do a normal “in common with” search between my mother and her cousin, Donald. The results of this search will show us everyone that matches mother and Donald, both.

In this example, I’ve done the following:

Selected Donald (who appears on mother’s match list, above) by clicking on the box to the left of his name, which you can see in the “Selected Matches” box at the bottom left indicating he has been selected.
Click on the “in common with” function button above the list of names.

After clicking on the “in common with” button, what I see (above) are all 91 people that match mother in common with Donald, meaning that mother and Donald both match all 91 of these people. This does NOT mean mother and Donald both match them on the same segment(s), only that they do match on at least one segment over the matching threshold.

As you can see, Donald’s name appears now in the “In Common With” box at the top left, along with a total of 91 people who match Donald and my mother both.

To clear any search, meaning all options, at any time, just click on the “reset filter” blue button, located to the right of the “not in common with” function button.

There are multiple features that work together for “in common with” matching and surname searching. Let’s take a look.

Example Two – Surname Searches Plus ICW, Combined

Now, I’ll enter the name Miller in the search box at the upper right. This shows me everyone who has name of Miller, or Miller appearing in their ancestral surnames, who match my mother.

Next, I want to select someone from that Miller match list to see which other people on the Miller match list they match in common with mother. Hey, let’s pick Donald!!!

To utilize a surname search (Miller) and ICW (Donald) together, do the following:

Enter the surname Miller in the search box on the upper right and click enter or the search (blue magnifying glass) icon. Donald appears on the Miller match list, as well as 90 other people. This means that Donald has Miller appearing in his list of ancestral surnames, since his surname is not Miller.
When the match results are returned, select Donald by clicking on the box to the left of his name.
Then click on the “in common with” function box above the list of matches.

I selected Donald, as you can see, by clicking the box beside his name, and his name now appears in the “Selected Matches” box in the lower left hand corner of the page, indicating that he has been selected. However, note that the name Miller still appears in the search box in the upper right hand corner.

Next, I click on the ICW function button, above the list of matches, and I see the following 22 matches that all share the Miller surname or Miller on their list of ancestral names AND match Donald and mother, both. I’m NOT seeing all of mother’s 91 Miller matches, but ONLY her Miller matches that are ALSO “in common with” Donald. This immediately gives me a list of people that are very likely descended from this same ancestral Miller line, and some of them will likely triangulate by utilizing the chromosome browser and other tools described in the Nine Autosomal Tools article.

This combination search is a wonderful feature, but this isn’t always what people want to do. Sometimes you want to first see the Miller matches, then select someone from that match list to run the full ICW tool and see ALL of their matches, not just the ICW Miller matches. This is the functionality that works differently than previously, but it’s actually very easy to accomplish.

Surname Search, Then ICW to Person on Match List, but not Combined

Often, you’ll find someone in the ICW Miller match list, for example, and you then want to see ALL of the ICW matches to that person, NOT just the ICW matches with Miller. Said another way, you want to utilize the name of someone found in the Miller search, but not limit the ICW results to just the Miller surname.

In this case, simply follow these steps:

Run the Miller search as in Example One.
Select Donald from the results by clicking on the box beside his name – step #2 in Example Two. Do NOT click on the ICW button, yet.
REMOVE Miller from the search box at upper right. After removing Miller, you will see the full match list load again (replacing the Miller match list), but Donald remains selected in the “Selected Matches” box in the lower left corner.
Click on the “in common with” function button to see the full ICW match list for the person selected.

Once again, you will see the full match list of 91 people between mother and Donald, as if Miller was never selected.

What Doesn’t Work

One function doesn’t work that worked previously, and that’s the ability to search for a location, meaning those locations in parenthesis in the ancestral surnames. This type of search is particularly important to people with Scandinavian ancestors whose surnames are patronymic, meaning they derive from a father’s first name, such as Johnsson for John’s son. These surnames changed generationally and locations are often more reliable in terms of genealogy searches.

This is probably a function of a feature that was being utilized by users in a way never imagined by the designers. Regardless, a bug report or enhancement request, depending in your perspective, has been submitted, but there is no known work-around today.

______________________________________________________________

Disclosure

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Legacy Tree Genealogists for genealogy research

Nine Autosomal Tools at Family Tree DNA

Posted on July 21, 2016 by Roberta Estes

The introduction of the Phased Family Finder Matches has added a new way to view autosomal DNA results at Family Tree DNA and a powerful new tool to the genealogists toolbox.

The Phased Family Finder Matches are the 9^th tool provided for autosomal test results by Family Tree DNA. Did you know where were 9?

Each of the different methodologies provides us with information in a unique way to assist in our relentless search for cousins, ancestors and our quests to break down brick walls.

That’s the good news.

The not-so-good news is that sometimes options are confusing, so I’d like to review each tool for viewing autosomal match information, including:

When to use each tool
How to use each tool
What the results mean to you
The unique benefits of each tool
The cautions and things you need to know about each tool including what they are not

The tools are:

Regular Matching
ICW (In Common With)
Not ICW (Not In Common With)
The Matrix
Chromosome Browser
Phased Family Matching
Combined Advanced Matching
MyOrigins Matching
Spreadsheet Matching

You Have Options

Family Tree DNA provides their clients with options, for which I am eternally grateful. I don’t want any company deciding for me which matches are and are not important based on population phasing (as opposed to parental phasing), and then removing matches they feel are unimportant. For people who are not fully endogamous, but have endogamous lines, matches to those lines, which are valid matches, tend to get stripped away when a company employs population based phasing – and once those matches are gone, there is no recovery unless your match happens to transfer their results to either Family Tree DNA or GedMatch.

The great news is that the latest new option, Phased Family Matching, is focused on making easy visual comparisons of high quality parental matches which is especially useful for those who don’t want to dig deeply.

There are good options for everyone at all ranges of expertise, from beginners to those who like to work with spreadsheets and extract every teensy bit of information.

So let’s take a look at all of your matching options at Family Tree DNA. If you’re not taking advantage of all of them, you’re missing out. Each option is unique and offers something the other options don’t offer.

In case you’re curious, I’ll be bouncing back and forth between my kit, my mother’s kit and another family member’s kit because, based on their matches utilizing the various tools, different kits illustrate different points better.

Also, please note that you can click on any image to see a larger version.

Selecting Options

Your selection options for Family Finder are available on both your Dashboard page under the Family Finder heading, right in the middle of the page, and the dropdown myFTDNA menu, on the upper left, also under Family Finder.

Ok, let’s get started.

#1 – Regular Matching

By regular matching, I’m referring to the matches you see when you click on the “Matches” tab on your main screen under Family Finder or in the dropdown box.

Everyone uses this tool, but not everyone knows about the finer points of various options provided.

There’s a lot of information here folks. Are you systematically using this information to its full advantage?

Your matches are displayed in the highest match first order. All of the information we utilize regularly (or should) is present, including:

Relationship Range
Match Date
Shared CentiMorgans
Longest (shared) Block
X-Match
Known Relationship
Ancestral Surnames (double click to see entire list)
Notes
E-mail envelope icon
Family Tree
Parental “side” icon

The Expansion “+” at the right side of each match, shown below, shows us:

Tests Taken
mtDNA haplogroup
Y haplogroup

Clicking on your match’s profile (their picture) provides additional information, if they have provided that information:

Most distant maternal ancestor
Most distant paternal ancestor
Additional information in the “about me” field, sometimes including a website link

On the match page, you can search for matches either by their full name, first name, last name or click on the “Advanced Search” to search for ancestral surname. These search boxes can be found at the top right.

The Advanced Search feature, underneath the search boxes at right, also provides you with the option of combining search criteria, by opening two drop down boxes at the top left of the screen.

Let’s say I want to see all of my matches on the X chromosome. I make that selection and the only people displayed as matches are those whom I match on the X chromosome.

You can see that in this case, there are 280 matches. If I have any Phased Family Matches, then you will see how many X matches I have on those tabs too.

The first selection box works in combination with the second selection box.

Now, let’s say I want to sort in Longest Block Order. That section sorts and displays the people who match me on the X chromosome in Longest Block Order.

Prerequisites

Take the Family Finder test or transfer your results from either 23andMe (V3 only) or Ancestry (V1 only, currently.)
Match must be over the matching threshold of 9cM if shared cM are less than 20, or, the longest block must be at least 7.69 cM if the total shared cM is 20 or greater.

Power Features

The ability to customize your view by combining search, match and sort criteria.

Cautions

It’s easy to forget that you’re ONLY working with X matches, for example, once you sort, and not all of your matches. Note the Reset Filter button above your matches which clears all of the sort and search criteria. Always reset, just to be on the safe side, before you initiate another sort.

Please note that the search boxes and logic are in the process of being redesigned, per a conversation Michael Davila, Director of Product Development, on 7-20-2016. Currently, if you search for the name “Donald,” for example, and then do an “in common with” match to someone on the Donald match list, you’ll only see those individuals who are in common with “Donald,” meaning anyone without “Donald” as one of their names won’t show as a match. The logic will be revised shortly so that you will see everyone “in common with,” not just “Donald.” Just be aware of this today and don’t do an ICW with someone you’ve searched for in the search box until this is revised.

#2 – In Common With (ICW)

You can select anyone from your match list to see who you match in common with them.

This is an important feature because it gives me a very good clue as to who else may match me on that same genealogical line.

For example, cousin Donald is related on the paternal line. I can select Donald by clicking the box to the left of his profile which highlights his row in yellow. I can then select what I want to do with Don’s match.

You will see that Don is selected in the match selection box on the lower left, and the options for what I can do with Don are above the matches. Those options are:

Chromosome Browser
In Common With
Not in Common With

Let’s select “In Common With.”

Now, the matches displayed will ONLY be those that I match in common with Don, meaning that Donald and I both match these people.

As you can see, I’m displaying my matches in common with Don in longest block order. You can click on any of the header columns to display in reverse order.

There are a total of 82 matches in common with Don and of those, 50 are paternally assigned. We’ll talk about how parental “side” assignments happen in a minute.

Prerequisites

None

Power Features

Can see at a glance which matches warrant further inspection and may (or may not) be from a common genealogical line.

Cautions

An ICW match does NOT mean that the matching individual IS from the same common line – only genealogical research can provide that information.
An ICW matches does NOT mean that these three people, you, your match and someone who matches both of you is triangulated – meaning matching on the same segment. Only individual matching with each other provides that information.
It’s easy to forget that you’re not working with your entire match list, but a subset. You can see that Donald’s name appears in the box at the upper left, along with the function you performed (ICW) and the display order if you’ve selected any options from the second box.

# 3 – Not In Common With

Now, let’s say I want to see all of my X matches that are not in common with my mother, who is in the data base, which of course suggests that they are either on my father’s side or identical by chance. My father is not in the data base, and given that he died in 1963, there is no chance of testing him.

Keep in mind though that because X matches aren’t displayed unless you have another qualifying autosomal segment, that they are more likely to be valid matches than if they were displayed without another matching segment that qualifies as a match.

For those who don’t know, X matches have a unique inheritance pattern which can yield great clues as to which side of your tree (if you’re a male), and which ancestors on various sides of your tree X matches MUST come from (males and females both.) I wrote about this here, along with some tools to help you work with X matches.

To utilize the “Not In Common With” feature, I would select my mother and then select the “Not In Common With” option, above the matches.

I would then sort the results to see the X matches by clicking on the top of the column for X-Match – or by any other column that I wanted to see.

I have one very interesting not in common with match – and that’s with a Miller male that I would have assumed, based on the surname, was a match from my mother’s side. He’s obviously not, at least based on that X match. No assuming allowed!

Prerequisites

None

Power Features

Can see at a glance which matches warrant further inspection and may be from a common genealogical line – or are NOT in common with a particular person.

Cautions

Be sure to understand that “not in common with” means that you, the person you match and the list of people shown as a result of the “Not ICW” do not all match each other. You DO match the person on your match list, but the list of “not in common with” matches are the people who DON’T match both of you. Not in common with is the opposite of “in common with” where your match list does match you and the person you’re matching in common with.
The X and other chromosome matches may be inherited from different ancestors. Every matching segment needs to be analyzed separately.

#4 – The Matrix

Let’s say that I have a list of matches, perhaps a list of individuals that I found doing an ICW with my cousin, and I wonder if these people match each other. I can utilize the Matrix grid to see.

Going back to the ICW list with cousin Donald, let’s see if some of those people match each other on the Matrix.

Let’s pick 5 people.

I’m selecting Cheryl, Rex, Charles, Doug and Harold.

I’m making these particular selections because I know that all of these people, except Harold, are related to my mother, Barbara, shown on the bottom row of the chart above. This chart, borrowed from another article (William is not in this comparison), shows how Cheryl, Rex, Charles and Barbara who have all DNA tested are related to each other. Some are related through the Miller line, some through the dual Lentz/Miller line, and some just from the Lentz line. Doug is related through the Miller line only, and at least 4 generations upstream. Doug may also be related through multiple lines, but is not descended from the Lentz line.

The people I’ve selected for the matrix are not all related to each other, and they don’t all share one common ancestral line.

Harold is a wild card – I have no idea how he is related or who he is related to, so let’s see what we can determine.

As you make selections on the Matrix page, up to 10 selections are added to the grid.

You can see that Charles matches Cheryl and Harold.

You can see that Rex matches Charles and Cheryl and Harold.

You can see that Doug matches only Cheryl, but this isn’t surprising as the common line between Doug and the known cousins is at least 4 generations further back in time on the Miller line.

The known relationship are:

Don and Cheryl are siblings, descended from the Lentz/Miller.
Rex is a known cousin on the Miller/Lentz line
Charles is a known cousin on the Lentz line only
Doug is a known cousin on the Miller line only

Let me tell you what these matches indicate to me.

Given that Harold matches Rex and Charles and Cheryl, IF and that’s a very big IF, he descends from the same lines, then he would be related to both sides of this family, meaning both the Miller and Lentz lines.

He could be a downstream cousin after the Lentz and Miller lines married, meaning a descendant of Margaret Lentz and John David Miller, or other Miller/Lentz couples
He could be independently related to both lines upstream. They did intermarry.
He could be related to Charles or Rex through an entirely separate line that has nothing to do with Lentz or Miller.

So I have no exact answer, but this does tell me where to look. Maybe I could find additional known Lentz or Miller line descendants to add to the Matrix which would provide additional information.

Prerequisites

None

Power Features

Can see at a glance which matches match each other as well.

Cautions

Matrix matches do NOT mean that these individuals match on the same segments, it just means they do match on some segment. A matrix match is not triangulation.
Matrix matches can easily be from different lines to different ancestors. For example, Harold could match each one of three individuals that he matches on different ancestral lines that have nothing to do with their common Lentz or Miller line.

#5 – Chromosome Browser

I want to know if the 5 individuals that I selected to compare in the Matrix match me on any of the same segments.

I’m going back to my ICW list with cousin Donald.

I’ve selected my 5 individuals by clicking the box to the left of their profiles, and I’m going to select the chromosome browser.

The chromosome browser shows you where these individuals match you.

Overlapping segments mean the people who overlap all match you on that segment, but overlapping segments do NOT mean they also match each other on these same segments.

Translated, this means they could be matching you on different sides of your family or are identical by chance. Remember, you have two sides to your chromosome, a Mom’s side and a Dad’s side, which are intermingled, and some people will match you by chance. You can read more about this here.

The chromosome browser shows you THAT they match you – it doesn’t tell you HOW they match you or if they match each other.

The default view shows matches of 5cM or greater. You can select different thresholds at the top of the comparison list.

You’ll notice that all 5 of these people match me, but that only two of them match me on overlapping segments, on chromosome 3. Among those 5 people, only those who match me on the same segments have the opportunity to triangulate.

This gives you the opportunity to ask those two individuals if they also match each other on this same chromosome. In this case, I have access to both of those kits, and I can tell you that they do match each other on those segments, so they do triangulate mathematically. Since I know the common ancestor between myself, Cheryl and Rex, I can assign this segment to John David Miller and Margaret Lentz. That, of course, is the goal of autosomal matching – to identify the common ancestor of the individuals who match.

You also have the option to download the results of this chromosome browser match into a spreadsheet. That’s the left-most download option at the top of the chromosomes. We’ll talk about how to utilize spreadsheets last.

The middle option, “view in a table” shows you these results, one pair of individuals at a time, in a table.

This is me compared to Rex. You will have a separate table for each one of the individuals as compared to you. You switch between them at the bottom right.

The last download option at the furthest right is for your entire list of matches and where they match you on your chromosomes.

Prerequisites

None

Power Features

Can visually see where individuals and multiple people match you on your chromosomes, and where they overlap which suggests they may triangulate.

Cautions

When two people match you on the same chromosome segment, this does not mean that they also match each other on that segment. Matching on overlapping segments is not triangulation, although it’s the first step to triangulation.
For triangulation, you will need to contact your matches to determine if they also match each other on the same segment where they both match you. You may also be able to deduce some family matching based on other known individuals from the same line that you also match on that same segment, if your match matches them on that segment too.
The chromosome browser is limited to 5 people at a time, compared to you. By utilizing spreadsheet matching, you can see all of your matches on a particular segment, together.

#6 – Phased Family Matching

Phased Family Matching is the newest tool introduced by Family Tree DNA. I wrote about it here. The icons assigned to matches make it easy to see at a glance which side of your family, maternal or paternal, or both, a match derives from.

Phased Family Matching allows you to link the DNA results of qualified relatives to your tree and by doing so, Family Tree DNA assigns matches to maternal or paternal buckets, or sometimes, both, as shown in the icon above.

This phased matching utilizes both parental phasing in addition to a slightly higher threshold to assure that the matches they assign to parental sides can be done so with confidence. In order to be assigned a maternal or paternal icon, your match must match you and your qualifying relative at 9cM or greater on at least one of the same segments over the matching threshold. This is different than an ICW match, which only tells you that you do match, not how you match or that it’s on the same segment.

Qualifying relatives, at this time, are parents, grandparents, uncles, aunts and first cousins. Additional relatives are planned in the near future.

Icons are ONLY placed based on phased match results that meet the criteria.

These icons are important because they indicate which side of your family a match is from with a great deal of precision and confidence – beyond that of regular matching.

This is best illustrated by an example.

In this example, this individual has their father and mother both in the system. You can see that their father’s side is assigned a blue icon and their mother’s side is assigned a pink (red) icon. This means they match this person on only one side of their family. A purple icon with both a male and female image means that this person is related to you on both sides of your family. Full siblings, when both parents are in the system to phase against, would receive both icons.

This sibling is showing as matching them on both sides of their family, because both parents are available for phasing.

If only one parent was available, the father, for example, then the sibling would only shows the paternal icon. The maternal icon is NOT added by inference. In Phased Family Matching, nothing is added by inference – only by exact allele by allele matching on the same segment – which is the definition of parentally phased matching.

These icons are ONLY added as a result of a high quality phased matches at or above the phased match threshold of 9cM.

You can read more about the Family Matching System in the Family Tree DNA Learning Center, here.

Prerequisites

You must have tested (or transferred a kit) for a qualifying relative. At this time qualifying relatives parents, grandparents, aunts, uncles and first cousins.
You must have uploaded a GEDCOM file or created a tree.
You must link the DNA of qualifying kits to that person your tree. I provided instructions for how to do this in this article.
You must match at the normal matching threshold to be on the match list, AND then match at or above the Phased Family Match threshold in the way described to be assigned an icon.
You must match on at least one full segment at or above 9cM.

Power Features

Can visually see which side of your family an individual is related to. You can be confident this match is by descent because they are phased to your parent or qualifying family member.

Cautions

If someone does not have an icon assigned, it does NOT mean they are not related on that particular side of the family. It only means that the match is not strong enough to generate an icon.
If someone DOES match on a particular side of the family, you will still need to do additional matching and genealogy work to determine which ancestor they descend from.
If someone is assigned to one side of your family, it does NOT preclude the possibility that they have a smaller or weaker match to your other side of the family.
If you upload a new Gedcom file after linking DNA to people in your tree, you will overwrite your DNA links and will have to relink individuals.
Having an icon assigned indicates mathematical triangulation for the person who tested, their parents or close relative against whom they were phased and their match with the icon. However, technically, it’s not triangulation in cases where very close relatives are involved. For example, parents, aunts, uncles and siblings are too closely related to be considered the third leg of the triangulation stool. First cousins, however, in my opinion, could be considered the third leg of the three needed for triangulation. Of course when triangulation is involved, more than three is always better – the more the merrier and the more certain you can be that you have identified the correct ancestor, ancestral couple, or ancestral line to assign that particular triangulated segment to.

# 7 – Combined Advanced Matching

One of the comparison tools often missed by people is Combined Advanced Matching.

Combined matching is available through the “Tools and Apps” button, then select “Advanced Matching.”

Advanced Matching allows you to select various options in combination with each other.

For example, one of my favorites is to compare people within a project.

You can do this a number of ways.

In the case of my mother, I’ll select everyone she matches on the Family Finder test in the Miller-Brethren project. This is a very focused project with the goal of sorting the Miller families who were of the Brethren faith.

You can see that she has several matches in that project.

You can select a variety of combinations, including any level of Y or mtDNA testing, Family Finder, X matching, projects and “last name begins with.”

One of the ways I utilize this feature often is within a surname project, for males in particular, I select one Y level of matching at a time, combined with Family Finder, “show only people I match on all tests” and then the project name. This is a quick way to determine whether someone matches someone on Family Finder that is also in a particular surname project. And when your surname is Smith, this tool is extremely valuable. This provides a least a hint as to the possible distance to a common ancestor between individuals.

Another favorite way to utilize this feature is for non-surname projects like the American Indian project. This is perfect for people who are hunting for others with Native roots that they match – and you can see their Y and mtDNA haplogroups as a bonus!

Prerequisites

Must have joined the particular project if you want to use the project match feature within that project.

Power Features

The ability to combine matching criteria across products.
The ability to match within projects.
The ability to specify partial surnames.

Cautions

If you match someone on both Family Finder and either Y or mtDNA haplogroups, this does NOT mean that your common Family Finder ancestor is on that haplogroup line. It might be a good place to begin looking. Check to see if you match on the Y or mtDNA products as well.
All matches have their haplogroup displayed, not just IF you also match that haplogroup, unless you’ve specified the Y or mtDNA options and then you would only see the people you match which would be in the same major haplogroup, although not always the same subgroup because not everyone tests at the same level.
Not all surname project administrators allow people who do not carry that surname in the present generation to join their projects.

# 8 – MyOrigins Matching

One tool missed by many is the MyOrigins matching by ethnicity. For many, especially if you have all European, for example, this tool isn’t terribly useful, but if you are of mixed heritage, this tool can be a wonderful source of information.

Your matches (who have authorized this type of matching) will be displayed, showing only if they match you on your major world categories. Only your matching categories will show. For example, if my match, Frances, also has African heritage and I do not, I won’t see Frances’s African percentage and vice versa.

In this example, the person who tested falls into the major categories of European and Middle Eastern. Their matches who fall into either of these same categories will be displayed in the Shared Origins box. You may not be terribly excited about this – unless you are mixed African, Asian, European and Native American – and you have “lost ancestors” you can’t find. In that case, you may be very excited to contact other matches with the same ethnic heritage.

When you first open your myOrigins page, you will be greeted with a choice to opt in (by clicking) or to opt out (by doing nothing) of allowing your ethnic matches to view the same ethnic groups you carry. Your matches will not be able to see your ethnic groups that they don’t have in common with you.

You can also access those options to view or change by clicking on Account Settings, Privacy and Sharing, and then you can view or change your selection under “My DNA Results.”

Prerequisites

Must authorize Shared Origins matching.

Power Features

The ability to discern who among your matches shares a particular ethnicity, and to what degree.

Cautions

Just because you share a particular ethnicity does NOT mean you match on the shared ethnic line. Your common ancestor with that person may be on an entirely unrelated line.

# 9 – Spreadsheet Matching

Family Tree DNA offers you the ability to download your entire list of matches, including the specific segments where your matches match you, to a spreadsheet.

This is the granddaddy of the tools and it’s a tool used by all serious genetic genealogists. It’s requires the most investment from you both in terms of understanding and work, but it also yields the most information.

The power of spreadsheet comparisons isn’t in the 5 people I pushed through to the chromosome browser, in and of themselves, but in the power of looking at the locations where all of your matches match you and known relatives on particular segments.

Utilizing the chromosome browser, we saw that chromosome 3 had an overlap match between Rex (green) and Cheryl (blue) as compared to my mother (background chromosome.)

We see that same overlap between Cheryl and Rex when we download the match spreadsheet for those 5 people.

However, when we download all of my mother’s matches, we have a much more powerful view of that segment, below. The 2 segments we saw overlapping on the chromosome browser are shown in green. All of these people colored pink match my mother on some part of the 37cM segment she shares with Rex.

This small part of my master spreadsheet combines my own results, rows in white, with those of my mother, rows in pink.

In this case, I only match one of these individuals that mother also matches on the same segment – Rex. That’s fine. It just means that I didn’t receive the rest of that DNA from mother – meaning the portions of the segments that match Sam, Cheryl, Don, Christina and Sharon.

On the first two rows, I did receive part of that DNA from mother, 7.64 of the 37cMs that Rex matches to Mom at a threshold of 5cM.

We know that Cheryl, Don and Rex all share a common ancestor on mother’s father’s side three generations removed – meaning John David Miller and Margaret Lentz. By looking at Cheryl, Don and Rex’s matches as well, I know that several of her matches do triangulate with Cheryl, Don and/or Rex.

What I didn’t know was how Christina fit into the picture. She is a new match. Before the new Phased Family Matching, I would have had to go into each account, those of Rex, Cheryl and Don, all of which I manage, to be sure that Christina matched all of them individually in addition to Mom’s kit.

I don’t have to do that now, because I can utilize the phased Family Matching instead. The addition of the Family Matching tool has taken this from three additional steps, assuming I have access to all kits, which most people don’t, to one quick definitive step.

Cheryl and Don are both mother’s first cousins, so matches can be phased against them. I have linked both of them to mother’s kit so she how has several individuals who are phased to Don and Cheryl which generate paternal icons since Don and Cheryl are related to mother on her father’s side.

Now, instead of looking at all of the accounts individually, my first step is to see if Christina has a paternal icon, which, in this case, means she phased against either Don and/or Cheryl since those are the only two people linked to mother who qualify for phasing, today.

Look, Christina does have a paternal icon, so I can add “Dad” into the side column for Christine in the spreadsheet for mother’s matches AND I know Christina triangulates to Mom and either Cheryl or Don, which ever cousin she phased against.

I can see which cousin she phased against by looking at the chromosome browser and comparing mother against Cheryl, Don and Christina. As it turns out, Christina, in green, above, phased against both Cheryl and Don whose results are in orange and blue.

It’s a great day in the neighborhood to be able to use these tools together.

Prerequisites

Must download matches spreadsheet through the chromosome browser, adding new matches to your spreadsheet as they occur.
Must have a familiarity with Excel or another spreadsheet.
Must learn about matching, match groups and triangulation.

Power Features

The ability to control the threshold you wish to work with. For matches over the match threshold, Family Tree DNA provides all segment matches to 1cM with a total of 500 SNPs.
The ability to see trends and groups together.
The ability to view kits from all of your matches for more powerful matching.
The ability to combine your results with those of a parent (or sibling if parents not available) to see joint matching where it occurs.

Cautions

There is a comparatively steep learning curve if you’re not familiar with using spreadsheets, but it’s well worth the effort if you are serious about proving ancestors through triangulation.

Summary

I’m extremely grateful for the full complement of tools available at Family Tree DNA.

They provide a range of solutions for users at all levels – people who just want to view their ethnicity or to utilize matches at the vendor site as well as those who want tools like a chromosome browser, projects, ICW, not ICW, the Matrix, ethnicity matching, combined advanced matching and chromosome browser downloads for those of us who want actual irrefutable proof. No one has to use the more advanced tools, but they are there for those of us who want to utilize them.

I’m sorry, I’m not from Missouri, but I still want to see it for myself. I don’t want any vendor taking the “trust me” approach or doing me any favors by stripping out my data. I’m glad that Family Tree DNA gives us multiple options and doesn’t make one size fit all by using a large hammer and chisel.

The easier, more flexible and informative Family Tree DNA makes the tools, the easier it will be to convince people to test or download their data from other vendors. The more testers, the better our opportunity to find those elusive matches and through them, ancestors.

The Concepts Series

I’ve been writing a “Concepts” series of articles. Recent articles have been about how to utilize and work with autosomal matches on a spreadsheet.

You might want to read these Concepts articles if you’re serious about working with autosomal DNA.

Concepts – How Your Autosomal DNA Identifies Your Ancestors

Concepts – Identical by…Descent, State, Population and Chance

Concepts – CentiMorgans, SNPs and Pickin’ Crab

Concepts – Parental Phasing

Concepts – Downloading Autosomal Data from Family Tree DNA

Concepts – Managing Autosomal DNA Matches – Step 1 – Assigning Parental Sides

Please join me shortly for the next Concepts article – Step 2 – Who’s Related to Whom?

In the meantime:

Make full use of the autosomal tools available at Family Tree DNA.
Test additional relatives meaning parents, grandparents, aunts, uncles, half-siblings, siblings, any cousin you can identify and talk into testing.
Take test kits to family reunions and holiday gatherings. No, I’m not kidding.
Don’t forget Y or mtDNA which can provide valuable tools to identify which line you might have in common, or to quickly eliminate some lines that you don’t have in common. Some cousins will carry valuable Y or mtDNA of your direct ancestral lines – and that DNA is full of valuable and unique information as well.
Link the DNA kits of those individuals you know to their place in your tree.
Transfer family kits from other vendors.

The more relatives you can identify and link in the system, the better your chances for meaningful matches, confirming ancestral relations, and solving puzzles.

Have fun!!!

______________________________________________________________

Disclosure

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Legacy Tree Genealogists for genealogy research

Concepts – Parental Phasing

Posted on April 6, 2016 by Roberta Estes

I recently used a technique called parental phasing as part of the proof that one Curtis Lore found in Pennsylvania was the same person as Curtis Benjamin Lore, found later in Indiana. Given that I’ve already used parental phasing as part of a proof argument, I’d like to break it down further and explain the concepts behind parental phasing, what it is, why it is so important, and why it works so well.

For those of you who don’t have at least one parent available to test, I’m truly sorry, and not just because of the lost DNA opportunity. But please do read this article, because you may be able to substitute other family members and derive at least some of the benefits, although clearly not all.

What is Parental Phasing?

The fundamental concept of parental phasing is that the only way you can obtain your DNA is through one or the other of your parents, so every one of your matches should match you plus one of your parents. Right?

Should, yes, but that’s not exactly how autosomal matching works in real life.

You can match someone in one of two ways:

Because you received the matching segment from one of your two parents, and they received that same segment from one of their two parents, a circumstance that is called identical by descent or IBD.
Because your match’s DNA is zigzagging back and forth between the DNA you inherited from both of your parents, or your DNA is zigzagging back and forth between their parents, either of which is called identical by chance or IBC.

I wrote about his in the article titled, Concepts – Identical by…Descent, State, Population and Chance.

Here’s the matching “Identical By” cheat sheet since you may find it helpful in this article as well.

How Does Parental Phasing Work?

Parental phasing works by comparing your DNA against your matches DNA, then comparing your matches DNA against your parents DNA, and telling you which, if either, or both, parents they match in addition to you. Oh yes, and there’s one more tiny tidbit – they must match you and your parent(s) on the same segment(s).

As bizarre as it sounds, sometimes your match will match you on one segment, and match your parents on an entirely different segment. While this was not an expected finding, it does happen, and frequently enough that it was found in every parental phasing test run – so it’s not an anomaly or something so rare you won’t see it.

Therefore, parental phasing may be a two part process, where:

Step 1 is determining whether or not your match matches either or both of your parents.
Step 2 is determining if your match matches you and your parent on the same segment(s), or at least part of the same segment? If not, then it’s not a phased IBD match – even though they do match you and your parent.

Conceptually, each of your matches will fall nice and cleanly into one, or both, of your parent’s buckets. Let’s look at a couple of examples. For each of the people who match you, they will also match your parents on the same segment as follows:

Match	Matches Your Mother	Matches Your Father	Matches Neither Parent	Comment
Susie	Yes	No		From Mom’s side, IBD
John	No	Yes		From Dad’s side, IBD
Bob	Yes	Yes		Matches both parents lines, IBD and may be IBP
Roxanne	No	No	Yes	Identical by Chance, IBC

Please Note: Your match list will change if you change your matching threshold, and so will your phased matches to your parents. In other words, while someone might not match you and a parent both on the same segment at 15cM, you might well match on a common segment at a 10, 7 or 5cM threshold.

So in essence, parental phasing puts your matches into very useful buckets for you and helps eliminate false positives – or matches that appear real but aren’t.

How Can Someone Match Me But Not My Parents?

That’s a really good question. Sometimes you match someone because you received common DNA from an ancestor, through your parents, which means you’re identical by descent (IBD), a legitimate genealogical match. But other times, you match someone just by chance because their DNA is matching pieces of both of your parents’ DNA, and not because you actually share a common ancestor.

Let’s take a look.

This first graphic shows you with an identical by descent match to your match’s father’s DNA. Your match’s father shares a common relative with (at least) one of your mother’s lines.

In the most basic terms, an identical by descend (IBD) match looks like this, where your match is matching you on one of your parent’s strands of DNA. Both matching strands are colored green in this example.

Of course, your DNA does not come labeled as to which side is mother’s and which side is father’s. You can read more about that here. If it did, we wouldn’t even need to be having this discussion at all – because that’s what parental phasing does. It tells you which side of your family your DNA match came from.

You can see in the above example that you and your match both share an actual strand of DNA. You inherited yours from your Mom and your match inherited theirs from their Dad, which means your Mom and their Dad share a common ancestor. However, to be able to discern that fact, that your Mom and your match’s Dad share a common ancestor, you need to be able to phase the DNA of both you and your match to know which parent that strand came from.

In reality, your DNA and their DNA is entirely mixed in each of you, shown in the chart below, and without additional information, neither of you will know which strand of DNA you match on, or who you inherited it from. Initially, you will only know THAT you match.

So here’s what your DNA really looks like. It’s up to the DNA matching software to look at the two strands of your DNA that’s mixed together, and the two strands of your match’s DNA that’s mixed together and see if there is a common grouping of DNA at each location that extends for at least 10 locations in length, which is the “threshold” for our example that signifies a match that is likely to be “real” versus IBC, or identical by chance. In my example, that common grouping is the green “Matching Portions” column, above.

An identical by chance match looks like the chart below. You can see that the green matching DNA is zigzagging back and forth between your parents’ DNA.

It can even be worse where your match’s Mom’s and Dad’s DNA is also zigzagging back and forth, but you can certainly get the idea that there are all kinds of ways to NOT match but only three ways to legitimately match – Mom’s side, Dad’s side, or both.

So you can see that indeed, you do technically match, but not because you share a DNA segment of any size with one parent, but because your match’s DNA matches part of your Mom’s DNA and part of your Dad’s, which means that DNA segment does NOT come from one common ancestor, meaning not IBD. However, the matching software can’t tell the difference, because your strands aren’t coded to Mom and Dad.

What parental phasing does is to assign your matches to “sides” or buckets based on whether they match your Mom or Dad in addition to you.

One Parent Matches

In my case, I only have one parent whose DNA is available. Therefore, all of my matches will either match both my mother and me, or not. The balance that do not match me and my mother, both, will either match to my father or will be IBC, identical by chance matches. Unfortunately, just by utilizing one-parent phasing, I can’t tell if the “non-Mom” matches are really to my father or are IBC.

Let’s look at an example.

Match	Mom’s Side	Dad or IBC	Comment
Denny	Yes	Probably not	Mom’s side, could also match on Dad’s side but we have no way to tell. My parents lines come from different parts of the world except that they both married into Native American lines.
Sally	No	Yes	Can’t tell whether Dad’s side or IBC
Derrell	No	Yes	Also matches cousin on Dad’s side on same segments, so Derrell is assigned to Dad’s side pending triangulation.

By using the ICW tool at Family Tree DNA, shown below, I can see who matches me and my matches, both – in this case, me and my mother.

No Parent Matches

If I have no parents in the system, but several other close family members, like uncles or cousins, I can easily see who else I match in common with my match.

In other words, without my mother to match, Denny will either match my Mom’s side family members, and I can tentatively group him there, my Dad’s side family members, and I can tentatively group him there, or neither, in which case I can’t do anything with him except note that fact.

An Example

I’m going to use my proven cousin Denny for my examples, because that’s who I used in my Curtis Lore case study and our connection is proven both genetically and genealogically.

Here’s Denny’s match list. My mother is Denny’s closest match and I’m his second closest.

Therefore, I can use the ICW technique to effectively put my matches into buckets that divide my DNA in half, if I have both parents.

If I have one parent, I can fill one bucket for sure by putting everyone who matches both my mother and me into the “mother” bucket. The balance will be in the “Father +IBC” bucket.

This is easy to do at Family Tree DNA by using the crossed arrow ICW tool to find everyone who matches me in common with my mother.

If I don’t have either parent, but I have an uncle or a cousin, I can still assign some matches to buckets by utilizing this same ICW tool. What I can’t do without both parents is to eliminate IBC or identical by chance matches from my match list. I need both parents or at least well fleshed out match groups to do that. There are examples of using match groups to identify IBC matches in the article, Identical By…Descent, Chance, Population and State.

Furthermore, I will need to download my match lists for both my mother and myself to verify that each person matches both my mother and myself on a common segment.

Testing the Theory

Let’s use my real life example and see how this works. I’m going to utilize three generations, because this gives us the ability to see the parental phasing work twice. In this illustration, below, four people have tested, Denny, Mother, Me and My Child.

Denny and my child, who are 3^rd cousins once removed, match on the following DNA segments, utilizing the Family Tree DNA chromosome browser. We are comparing against Denny, meaning he is the “background” black chromosome. The orange illustrates where my child matches Denny.

There are no matching segments on chromosomes 18-22. I have not included X chromosome matching.

Here’s the same information in chart format.

You can see that Denny and my child have several fairly significant segment matches, along with some smaller ones too. The question is, which of those segments are legitimate, meaning IBD and which are not, meaning IBC?

Let’s phase my child against my DNA and see which of these segment matches hold up.

My child is orange, and I am blue and we are both matching against cousin Denny.

As you can see, many of those segments are legitimate because Denny matches both me and my child on the same segments. So they are not IBC, or identical by chance, but IBD, identical, literally, by descent – because my child received them from me.

In some cases, Denny matches only me, blue, which is fine because all that means is that either our matches are IBC or I didn’t pass that DNA to my child. Both matches on chromosome 3 are to me (blue) and not to my child (orange).

However, in the cases where Denny matches my child (orange,) and not me (blue,) on the same segments, that means that either Denny and my child share an ancestor that is through my child’s father or the matches are IBC. Those matches are not through me. In other words, those segments did not pass phasing. You can see examples of that on chromosomes 1, 4 and 14, and partial matches on 11 and 12.

Chromosome 16 shows a really good example of a crossover event where my child, orange, received part of my DNA, blue, but about half way through my segment, it was divided and my child inherited part of mine and the other half from their father. So, visually, you can see that my child only matches Denny on about half of the segment where I match Denny.

Matches Spreadsheet

I downloaded the results of both Denny’s matches to me and Denny’s matches to my child into one Matches Spreadsheet and have color coded them so that you can see the relationships. If Denny matches both me and my child, you will see a common segment on that chromosome for both me and my child in the spreadsheet. Rows where Denny matches my child are light orange and rows where Denny matches me are light blue, similar to the chromosome browser colors.

There are only three possible conditions and I have colored the chromosome column accordingly:

Denny matches me only – dark teal – may be a legitimate match but we don’t have enough information to tell at this point
Denny matches my child only, but not me – red – NOT a legitimate match – identical by chance (IBC)
Denny matches me and my child both – boxed green – a legitimate identical by descent (IBD) match

You’ll note that some of these matches are exact. For example on the first matching segment of chromosome 2, below, my child received this entire segment of my DNA. It was not divided at all.

However, in the next two matching groups on chromosome 2, my child received most of the DNA I share with Denny, but some was shaved off, but not half.

On chromosome 16, my child received almost exactly half of the DNA segment that I share with Denny.

On chromosomes 11 and 17, my child shares more DNA with Denny than I do, which means that all of that DNA isn’t ancestral though me. In this case, either there are some fuzzy boundaries, a read error, part of the DNA is IBD and part is IBC or part of the DNA is matching through both parents.

On chromosome 14, I match Denny, but my child received none of that DNA, which is why I’ve added the color teal.

Now, let’s phase me against my mother and see how the DNA matches hold up in a third generation.

Adding the Next Generation

The view of the chromosome browser below shows Denny matching my child, in orange, me in blue and my mother in green.

Amazingly, many of these segments follow through all three generations.

Let’s see how the various matches stacked up, pardon the pun.

I’ve added Denny’s matches to mother to the Matches Spreadsheet and her rows are colored green.

On the Matches Spreadsheet from the first example, there were several segments where Denny matched only me and not my child. They were colored teal. In the chart below, so we can track those segments, I have colored them teal in the matchname column, and you can see the resolution of how they did or didn’t survive phasing against my mother in the chromosome column.

Of those 11 segments, 2 phased with my mother, the rest did not. That makes sense, since none of those are segments I passed on to my child, so they would be more likely to be IBC.

The legend for the spreadsheet above is as follows:

Dark teal in chromosome column – Denny matches Mom only – may be a legitimate match but we don’t have enough information to know (chromosomes 1, 2, 4, 5, 6, 7, 9, 12 and 15)
Dark teal in matchname column, plus red in chromosome column – previously Denny matched only me, now I do not phase against my mother, so this is an IBC match (chromosomes 1, 3, 4, 5, 6, 7, 10, 12 and 17)
Dark teal in matchname column, plus green box in chromosome column – previously Denny only matched me, but now this segment is parentally phased and considered legitimate (chromosomes 2 and 10)
Red in chromosome column – does not phase against parent, so not a legitimate match – IBC (chromosomes 1, 3, 4, 5, 6, 7, 10, 11, 12, 14 and 17)
Green box indicates a phased match – considered IBD and legitimate (chromosomes 1, 2, 10, 14, 15, 16 and 17)

Anomalies

*So what the heck happened with chromosome 11?

In the first example, this segment received a green box because Denny matched both me and my child on a partial segment, which means that partial segment is phased and considered legitimate.

When we moved to the next generation, phasing against my mother, Denny does not match my mother on this segment, so it could NOT have arrived in me and my child via my mother, so it is not IBD, even though it appeared that way initially. Because of this, I’ve changed the box color to red for a non-IBD match.

How could this happen?

First, it’s a very small segment overlap match, and second, Denny matched more to my child than to me, which is a neon warning sign that this segment match is suspect, especially those two conditions in combination with each other.

Here’s an example of how, genetically, a match could phase with a parent in one generation, but not hold into the next generation.

This match matches both me and my child (gold), but not my mother, who has no gold. As you can see, the match does accrue 10 gold location matches in a row, but not 10 green ones, so doesn’t match my mother. The larger the number of locations in a row required to be considered a match, the less likely this type of random matching will be to occur.

This is both the purpose and the quandry of thresholds. Finding that sweet spot that doesn’t eliminate real matches, but is high enough to be useful in eliminating false positive (IBC) matches. And I can tell you, there are just about as many opinions on what that threshold number should be as there are people giving opinions – and everyone seems to have one! You can read more about this in the article, Concepts – CentiMorgans, SNPs and Pickin’ Crab.

Segment Survival

Let’s take a look and see how many of which size segments survived parental phasing. Are some of those smaller segments legitimate matches, or did we lose them in phasing?

The chart below shows the results in segment size order, color coded as follows:

Red = segments that did not phase and were IBC
Teal = segments that match Mom only and may or may not be valid. We don’t have any way to know without additional matches.
Green = segments that phased and are IBD

As you would expect, all of the larger segments phased, but surprisingly, so did several of the smaller segments, through three generations.

Given the fact that teal matches did not phase, for the most part, in the previous example, and given that the teal segments are mostly small, my suspicion would be that most of these teal segments would not phase (with the probable exception of the 10.27 cm segment), if we have the opportunity to find out – which we don’t.

This example is for a non-endogamous line, or better stated, with distant endogamous groups in multiple lines. Endogamous results would probably be different.

Statistics

What do our statistics look like?

There were 58 matching segments between Denny, my child, me and my mother.

	Match To Whom	# Segments	# Phased	%
Denny	My Child	12	8	75
Denny	Me	22	11	50
Denny	Mother	24	Probably at least 11
Total		58

Of those 58 total matches, 16 were IBC meaning they did not match up through my mother.

Total

Segment Matches

IBC (no phase)

IBD (phase)

Just Mother

Match Groups

2 gen Groups

3 gen Groups

28%

50%

22%

25%

75%

Thirteen match just to mother (teal), of which one, on chromosome 12 for 10.27 centiMorgans, is the most likely to be legitimate, or IBD. The rest were smaller segments and none were passed to a the child, so they are less likely to be legitimate, or IBD.

There are a total of 12 matching groups, of which 3 are for only two generations, me and mother. In other words, not all of that DNA got passed on to my child, but at least some of it did 9 of those 12 times.

Does Size Matter?

I wanted to see how the small versus large segments faired in terms of three generations of parental phasing. Are smeller segments legitimate or not? Do they stand up? The “Phased cMs by Size” chart above was sorted in chromosome order, with teal being a match to mother only (so we don’t know if it phased), green meaning the segment DID phase and red meaning it DID NOT phase with the parent.

Removing the teal blocks, which match to mother only, meaning we don’t know if they would parentally phase or not, leaves us with the blocks that had the opportunity to phase, and whether they passed or failed. 100% of the blocks 3.57cM and above phased. A natural dividing line seems to occur about the 3.5 cM level, shown below.

It’s interesting that all matches above 3.36 cM phased, several of them twice, through three generations or two transmission (inheritance) events. Of those, 9, or 43% were under the 10cM threshold suggested by some, and 7, or 33% were under the 7cM threshold.

Most of the segments 3.36 cM and below, did not pass phasing. Of those, 6 or 26% did pass phasing, while 17, or 74%, did not. Note that this cM level is with the SNP threshold set to 500 SNPs, which is generally the lowest number I use.

Segment Size	# of Segments	# Segments Phased	%
Larger than 3.5 cM	21	21	100
Smaller than 3.5 cM	23	6	26

Are these results a function of this particular family, or would this hold if more parental generational phasing studies were performed?

Let’s see.

The Threshold Study

I was surprised by the seemingly low threshold of 3.5 cM that appeared to be the rough dividing line for cMs that passed parental phasing and those that did not. I undertook a small study of four additional 3 generation non-endogamous families.

I’ve included the Lore study that we discussed above in the first column.

I have also removed all duplicates in the results below, since the duplicates were an artifact of matching groups where we had three generations to match.

I completed 4 different three-generation studies in 4 unrelated non-endogamous families and noted the rough threshold for where matches seem to pass or fail phasing – in other words, the fall line. In all 4 examples below, the threshold was between 2.46 and 3.16 cM. You could move it slightly higher, depending on what criteria you use for the “fall line,” which is why I’ve included the raw data. In all cases, the SNP threshold was at 500 so you would not see any matches with fewer than 500 SNPs.

The black bar in the results below marks the location where the shift from fail to pass occurs in the various studies.

Additionally, I have one 4-generation study available as well. The closest related of the 4 generations that were being matched against were first cousins, then first cousins once removed, then first cousins twice removed (equal to 2^nd cousins) then 1^st cousins three times removed (equal to second cousins once removed).

You can see, below, that the pass/fail threshold for this 4 generation, 3 transmission study was also at 3.69 cM for valid segments that survived. The segments labeled “2 match” mean that they did not get passed to the younger generations, so they only matched in the oldest two generations, 3 match the oldest 3 generations and 4 match meaning the match survived through all 4 generations.

It’s interesting that even some of the smaller segments held through all 4 generations.

Ethnicity Matters

Clearly, parental phasing is only successful when you have matches. Of the three data bases available for autosomal DNA comparisons today, Family Tree DNA and 23andMe likely have the largest representation of non-US participants, because the Ancestry.com test was not sold outside the US for quite some time. The Family Tree DNA Family Finder test was sold in the most locations outside the US.

Family Tree DNA probably has the best representation of Jewish DNA of all of the data bases.

Family Tree DNA projects facilitate the grouping of individuals by self-selected interest which includes ethnic categories, making those relationships visible by virtue of project membership wherein they are not readily evident in other data bases.

Therefore, by virtue of who has tested, if your ancestry is not “US” meaning a melting pot type of environment who are not recent arrivals, then you are likely to have less matches, so less phased matches too. If you have a high degree of any particular ethnicity, even if your ancestry is “US,” you may still have fewer matches. For example, 3 of 4 of my mother’s grandparents were either German or Dutch, and she has 710 matches, or roughly half the matches that I have. My father’s heritage was Appalachian, meaning Colonial American.

Here’s a quick chart showing the total matches as of April, 2016 for a number of individuals who contributed their match totals in Family Finder and who carry either no US heritage or a specific ethnicity. For purposes of comparison, three individuals with typical mixed colonial US heritage are shown at the top.

People with high percentages of African heritage tend to have few matches today, as do those of purely European heritage. Unfortunately, not many Africans or African-Americans test their DNA and DNA testing is not as popular in Europe as it is in the US. Many people in Europe are leary of DNA testing or don’t feel they need to test, because “we’ve always lived here.” I’m hopeful that the sustained popularity of programs like Who Do You Think You Are and Finding Your Roots will encourage more people of all ethnicities and locations to test from around the globe.

People from highly endogamous populations have a different issue to deal with, as you can see from the very high number of Jewish matches in the chart above. Since these people descend from a common founder population, they share a lot of ancestral DNA that is identical by population, meaning they did receive it from an ancestor, so it’s not IBC, but they received that segment because that particular segment is very prevalent within that population. Determining which ancestor contributed that piece of DNA is exceedingly difficult, if not impossible because several ancestors carried that same segment.

Therefore, while the segment is identical by descent, it’s probably not genealogically useful in a 100% endogamous scenario.

In an unpublished study, we discovered that while working with parentally phased Jewish results, it’s not unusual for up to half of the matches to not match the participant plus either parent on the same segments. Or conversely, they may match both parents, but the segments are comparatively small. Matching to both parents in an endogamous population, without a known familial relationship, and without at least one relatively large segment, is an indicator of IBP, identical by population, matches. For Jewish and other endogamous people, parental phasing is very promising, and will help them sort through irrelevant “diamond in the rough” matches indicated by no parent matches or smaller both parent matches to find the genealogically relevant gems.

In all parental phasing groups studied, no one lost less than 10% of their matches utilizing parental phasing and most people lost significantly more, up to half. I would very much like to see these same kinds of 3 or 4 generation parental phasing studies done for groups of Jewish, other endogamous and African American families. In order to do a study of one family, you need at least 3 generations who have tested and another known family member, like a first or second cousin perhaps, to match against.

In Summary

Dual parental phasing works wonderfully. One parent phasing works pretty well too. Even close relative phasing works, just not as well as parental phasing. You can only work with the people you have available to test, so test every relative you can convince!

If you have one or both parents to test, by all means, do. You’ll be able to phase your matches against both of your parents individually and eliminate the majority of IBC matches.

If you have grandparents or their siblings available to test, do, and quickly so you don’t lose the opportunity. Test the oldest person/generation in each line that you can.

If you don’t have both parents, test your half and full siblings, all of them, the more the better, because they inherited parts of your parents DNA that you didn’t.

Find your closest relatives and test them, yes, all of them.

If you are testing parents, you don’t need to test their children too, because their children will only receive half of their parent’s DNA, and you already have the parents DNA.

Even if you can’t phase your matches utilizing your parents DNA, you can use the combination of your matches with other relatively close family members to assign or suggest matches to both sides of your family along family lines – creating match groups. For example, if your match matches you and your great-uncle Charlie on the same segment, then it’s very likely that match is from the common ancestral line shared by your common ancestor with great-uncle Charlie – your great-grandparents. Triangulation, of course, will prove that.

Some of your relatives will be quite interested in DNA testing and others will be happy to test simply because it helps you, and they like to hear about the result of the genealogy research. I’ve discovered that providing a scholarship for the testing, especially for those people you really want to test, goes a very long way in convincing people that DNA testing for genealogy is something they might be interested in doing. If you can’t personally afford a scholarship for everyone, try the old fashioned collection jar. And no, I’m not kidding. It works wonders and gives everyone an opportunity to participate and invest as well, as much as they can afford.

Ethnicity testing has a lot of sizzle for some folks too – so don’t just deliver the dry facts – be sure to talk about the sizzle too. Sizzle sells! People get excited about the possibilities and of course, you’ll explain the result to them, so they get to visit with you a second time as well. Something to look forward to at next summer’s picnic!

Be sure to take swab kits to family events; picnics, reunions, graduation parties, weddings and holiday gatherings. Believe me, I have a DNA kit in my purse or car at all times. And maybe, if your extended family lives close by, resurrect the old-time Sunday afternoon tradition of “going calling.” Not only can you collect DNA, you can collect family memories too and I guarantee, you’ll make a new discovery with every visit. Take this opportunity to interview your relatives.

It’s amazing isn’t it, the things we do for this “DNA phase” that we’re all going through!

Acknowledgements

I want to thank Family Tree DNA for their ongoing support of projects and citizen scientists which makes these types of research studies possible. I also want to thank several individuals in the genetic genealogy community who provided their information and gave permission for me to incorporate their results into this article. Without sharing and collaboration, these types of efforts would simply not be possible.

______________________________________________________________

Disclosure

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Legacy Tree Genealogists for genealogy research

Autosomal DNA Matching Confidence Spectrum

Posted on September 25, 2015 by Roberta Estes

Are you confused about DNA matches and what they mean…different kinds of matches…from different vendors and combined results between vendors. Do you feel like lions and tigers and bears…oh my? You’re not alone.

As the vendors add more tools, I’ve noticed recently that along with those tools has come a significant amount of confusion surrounding matches and what they mean. Add to this issue confusion about the terminology being used within the industry to describe various kinds of matches. Combined, we now have a verbiage or terminology issue and we have confusion regarding the actual matches and what they mean. So, as people talk, what they mean, what they are trying to communicate and what they do say can be interpreted quite widely. Is it any wonder so many people are confused?

I reached out within the community to others who I know are working with autosomal results on a daily basis and often engaged in pioneering research to see how they are categorizing these results and how they are referring to them.

I want to thank Jim Bartlett, Blaine Bettinger, Tim Janzen and David Pike (in surname alphabetical order) for their input and discussion about these topics. I hope that this article goes a long way towards sorting through the various kinds of matches and what they can and do mean to genetic genealogists – and what they are being called. To be clear, the article is mine and I have quoted them specifically when applicable.

But first, let’s talk about goals.

Goals

One thing that has become apparent over the past few months is that your goals may well affect how you interpret data. For example, if you are an adoptee, you’re going to be looking first at your closest matches and your largest segments. Distant matches and small segments are irrelevant at least until you work with the big pieces. The theory of low hanging fruit, of course.

If your goal is to verify and generally validate your existing genealogy, you may be perfectly happy with Ancestry’s Circles. Ancestry Circles aren’t proof, as many people think, but if you’re looking for low hanging fruit and “probably” versus “positively,” Ancestry Circles may be the answer for you.

If you didn’t stop reading after the last sentence, then I’m guessing that “probably” isn’t your style.

If your goal is to prove each ancestor and/or map their segments to your DNA, you’re not going to be at all happy with Ancestry’s lack of segment data – so your confidence and happiness level is going to be greatly different than someone who is just looking to find themselves in circles with other descendants of the same ancestor and go merrily on their way.

If you have already connected the dots on most of your ancestry for the past 4 or 5 generations, and you’re working primarily with colonial ancestors and those born before 1700, you may be profoundly interested in small segment data, while someone else decides to eliminate that same data on their spreadsheet to eliminate clutter. One person’s clutter is another’s goldmine.

While, technically, the different types of tests and matches carry a different technical confidence level, your personal confidence ranking will be influenced by your own goals and by some secondary factors like how many other people match on a particular segment.

Let’s start by talking about the different kinds of matching. I’ve been working with my Crumley line, so I’ll be utilizing examples from that project.

Individual Matching, Group Matching and Triangulation

There is a difference between individual matching, group matching and triangulation. In fact, there is a whole spectrum of matching to be considered.

Individual Matching

Individual matching is when someone matches you.

That’s great, but one match out of context generally isn’t worth much. There’s that word, generally, because if there is one thing that is almost always true, it’s that there is an exception to every rule and that exception often has to do with context. For example, if you’re looking for parents and siblings, then one match is all you need.

If this match happens to be to my first cousin, that alone confirms several things for me, assuming there is not a secondary relationship. First, it confirms my relationship with my parent and my parent’s descent from their parents, since I couldn’t be matching my first cousin (at first cousin level) if all of the lines between me and the cousin weren’t intact.

However, if the match is to someone I don’t know, and it’s not a close relative, like the 2^nd to 4^th cousins shown in the match above, then it’s meaningless without additional information. Most of your matches will be more distant. Let’s face it, you have a lot more distant cousins than close cousins. Many ancestors, especially before about 1900, were indeed, prolific, at least by today’s standards.

So, at this point, your match list looks like this:

Bridget looks pretty lonely. Let’s see what we can do about that.

Matching Additional People

The first question is “do you share a common ancestor with that individual?” If yes, then that is a really big hint – but it’s not proof of anything – unless they are a close relative match like we discussed above.

Why isn’t a single match enough for proof?

You could be related to this person through more than one ancestral line – and that happens far more than I initially thought. I did an analysis some time back and discovered that about 15% of the time, I can confirm a secondary genealogical line that is not related to the first line in my tree. There were another 7% that were probable – meaning that I can’t identify a second common ancestor with certainty, but the surname and location is the same and a connection is likely. Another 8% were from endogamous lines, like Acadians, so I’m sure there are multiple lines involved. And of those matches (minus the Acadians), about 10% look to have 3 genealogical lines, not just two. The message here – never assume.

When you find one match and identify one common genealogical line, you can’t assume that is how you are genetically related on the segment in question.

Ideally, at this point, you will find a third person who shares the common ancestor and their DNA matches, or triangulates, between you and your original match to prove the connection. But, circumstances are not always ideal.

What is Triangualtion?

Triangulation on the continuum of confidence is the highest confidence level achievable, outside of close relative matching which is evident by itself without triangulation.

Triangulation is when you match two people who share a common ancestor and all three of you match each other on that same segment. This means that segment descended to all three of you from that common ancestor.

This is what a match group would look like if Jerry matches both John and Bridget.

Example 1 – Match Group

The classic definition of triangulation is when three people, A, B and C all match each other on the same segment and share a known, identifiable common ancestor. Above, we only have two. We don’t know yet if John matches Bridget.

A matches B
A matches C
B matches C

This is what an exact triangulation group would look like between Jerry, John and Bridget. Most triangulation matches aren’t exact, meaning the start and/or end segment might be different, but some are exact.

Example 2 – Triangulation Group

It’s not always possible to prove all three. Sometimes you can see that Jerry matches Bridget and Jerry matches John, but you have no access to John or Bridget’s kits to verify that they also match each other. If you are at Family Tree DNA, you can run the ICW (in common with) tool to see if John and Bridget do match each other – but that tool does not confirm that they match on the same segment.

If the individuals involved have uploaded their kits to GedMatch, you have the ability to triangulate because you can see the kit numbers of your matches and you can then run them against each other to verify that they do indeed match each other as well. Not everyone uploads their kits to GedMatch, so you may wind up with a hybrid combination of triangulated groups (like example 2, above) and matching groups (like example 1, above) on your own personal spreadsheet.

Matching groups (that are not triangulated) are referred to by different names within the community. Tim Janzen refers to them as clusters of cousins, Blaine as pseudo triangulation and I have called them triangulation groups in the past if any three within the group are proven to be triangulated. Be careful when you’re discussing this, because matching groups are often misstated as triangulated groups. You’ll want to clarify.

Creating a Match List

Sometimes triangulation options aren’t available to us. For example, at Family Tree DNA, we can see who matches us, and we can see if they match each other utilizing the ICW tool, but we can’t see specifically where they match each other. This is considered a match group. This type of matching is also where a great deal of confusion is introduced because these people do match each other, but they are NOT (yet) triangulated.

What we know is that all of these people are on YOUR match list, but we don’t know that they are on each other’s match lists. They could be matching you on different sides of your DNA or, if smaller segments, they might be IBC (identical by chance.)

You can run the ICW (in common with) tool at Family Tree DNA for every match you have. The ICW tool is a good way to see who matches both people in question. Hopefully, some of your matches will have uploaded trees and you can peruse for common ancestors.

The ICW tool is the little crossed arrows and it shows you who you and that person also match in common.

You can run the ICW tool in conjunction with the ancestral surname in question, showing only individuals who you have matches in common with who have the Crumley surname (for example) in their ancestral surname list. This is a huge timesaver and narrows your scope of search immediately. By clicking on the ICW tool for Ms. Bridget, you see the list, below of those who match both the person whose account we are signed into and Ms. Bridget, below.

Another way to find common matches to any individual is to search by either the current surname or ancestral surnames. The ancestral surname search checks the surnames entered by other participants and shows them in the results box.

In the example above, all of these individuals have Crumley listed in their surnames. You can see that I’ve sorted by ancestral surname – as Crumley is in that search box.

Now, your match lists looks like this relative to the Crumley line. Some people included trees and you can find your common ancestor on their tree, or through communications with them directly. In other cases, no tree but the common surname appears in the surname match list. You may want to note those results on your match list as well.

Of course, the next step is to compare these individuals in a matrix to see who matches who and the chromosome browser to see where they match you, which we’ll discuss momentarily.

Group Matching

The next type of matching is when you have a group of people who match each other, but not necessarily on the same segment of DNA. These matching groups are very important, especially when you know there is a shared ancestor involved – but they don’t indicate that the people share the same segment, nor that all (or any) of their shared segments are from this particular ancestor. Triangulation is the only thing that accomplishes proof positive.

This ICW matrix shows some of the Crumley participants who have tested and who matches whom.

You can display this grid by matching total cM or by known relationship (assuming the individuals have entered this information) or by predicted relationship range. The total cMs shared is more important for me in evaluating how closely this person might be related to the other individual.

The Chromosome Browser

The chromosome browser at Family Tree DNA shows matches from the perspective of any one individual. This means that the background display of the 22 Chromosomes (plus X) is the person all of the matches are comparing against. If you’re signed in to your account, then you are the black background chromosomes, and everyone is being compared against your DNA. I’m only showing the first 6 chromosomes below.

You can see where up to 5 individuals match the person you’re comparing them to. In this case, it looks like they may share a common segment on chromosome 2 among several descendants. Of course, you’d need to check each of these individuals to insure that they match each other on this same segment to confirm that indeed, it did come from a common ancestor. That’s triangulation.

When you see a grouping of matches of individuals known to descend from a common ancestor on the same chromosome, it’s very likely that you have a match group (cluster of cousins, pseudo triangulation group) and they will all match each other on that same segment if you have the opportunity to triangulate them, but it’s not absolute.

For example, below we have a reconstructed chromosome 8 of James Crumley, the common ancestor of a large group of people shown based on matches. In other words, each colored segment represents a match between two people. I have a lot more confidence in the matches shown with the arrows than the single or less frequent matches.

This pseudo triangulation is really very important, because it’s not just a match, and it’s not triangulation. The more people you have that match you on this segment and that have the same ancestor, the more likely that this segment will triangulate. This is also where much of the confusion is coming from, because matching groups of multiple descendants on the same segments almost always do triangulate so they have been being called triangulation groups, even when they have not all been triangulated to each other. Very occasionally, you will find a group of several people with a common ancestor who triangulate to each other on this common segment, except one of a group doesn’t triangulate to one other, but otherwise, they all triangulate to others.

This situation has to be an error of some sort, because if all of these people match each other, including B, then B really must match D. Our group discussed this, and Jim Bartlett pointed out that these problem matches are often near the vendor matching threshold (or your threshold if you’re using GedMatch) and if the threshold is lowered a bit, they continue to match. They may also be a marginal match on the edge, so to speak or they may have a read error at a critical location in their kit.

What “in common with” matching does is to increase your confidence that these are indeed ancestral matches, a cousin cluster, but it’s not yet triangulation.

Ancestry Matches

Ancestry has added another level of matching into the mix. The difference is, of course, that you can’t see any segment data at all, at Ancestry, so you don’t have anything other than the fact that you do match the other person and if you have a shakey leaf hint, you also share a common ancestor in your trees.

When three people match each other on any segment (meaning this does not infer a common segment match) and also share a common ancestor in a tree, they qualify to be a DNA Circle. However, there is other criteria that is weighted and not every group of 3 individuals who match and share an ancestor becomes a DNA Circle. However, many do and many Circles have significantly more than three individuals.

This DNA Circle is for Phebe Crumley, one of my Crumley ancestors. In this grouping, I match one close family group of 5 people, and one individual, Alyssa, all of whom share Phebe Crumley in their trees. As luck would have it, the family group has also tested at Family Tree DNA and has downloaded their results to GedMatch, but as it stands here at Ancestry, with DNA Circle data only…the only thing I can do is to add them to my match list.

In case you’re wondering, the reason I only added three of the 5 family members of the Abija group to my match list is because two are children of one of the members and their Crumley DNA is represented through their parent.

While a small DNA Circle like Phebe Crumley’s can be incorrect, because the individuals can indeed be sharing the DNA of a different ancestor, a larger group gives you more confidence that the relationship to that group of people is actually through the common ancestor whose circle you are a member of. In the example Circle shown below, I match 6 individuals out of a total of 21 individuals who are all interrelated and share Henry Bolton in their tree.

New Ancestor Discoveries

Ancestry introduced New Ancestor Discoveries (NADs) a few months ago. This tool is, unfortunately, misnamed – and although this is a good concept for finding people whose DNA you share, but whose tree you don’t – it’s not mature yet.

The name causes people to misinterpret the “ancestors” given to them as genuinely theirs. So far, I’ve had a total of 11 NADS and most have been easily proven false.

Here’s how NADs work. Let’s say there is a DNA Circle, John Doe, of 3 people and you match two of them. The assumption is that John Doe is also your ancestor because you share the DNA of his descendants. This is a critically flawed assumption. For example, in one case, my ancestors sister’s husband is shown as my “new ancestor discovery” because I share DNA with his descendants (through his wife, my ancestor’s sister.) Like I said, not mature yet.

I have discussed this repeatedly, so let’s just suffice it to say for this discussion, that there is absolutely no confidence in NADs and they aren’t relevant.

Shared Matches

Ancestry recently added a Shared Matches function.

For each person that you match at Ancestry, that is a 4^th cousin or closer and who has a high confidence match ranking, you can click on shared matches to see who you and they both match in common.

This does NOT mean you match these people through the same ancestor. This does NOT mean you match them on the same segment. I wrote about how I’ve used this tool, but without additional data, like segment data, you can’t do much more with this.

What I have done is to build a grid similar to the Family Tree DNA matrix where I’ve attempted to see who matches whom and if there is someone(s) within that group that I can identify as specifically descending from the same ancestor. This is, unfortunately, extremely high maintenance for a very low return. I might add someone to my match list if they matched a group (or circle) or people that match me, whose common ancestor I can clearly identify.

Shared Matches are the lowest item on the confidence chart – which is not to say they are useless. They can provide hints that you can follow up on with more precise tools.

Let’s move to the highest confidence tool, triangulation groups.

Triangulation Groups

Of course, the next step, either at 23andMe, Family Tree DNA, through GedMatch, or some combination of each, is to compare the actual segments of the individuals involved. This means, especially at Ancestry where you have no tools, that you need to develop a successful begging technique to convince your matches to download their data to GedMatch or Family Tree DNA, or both. Most people don’t, but some will and that may be the someone you need.

You have three triangulation options:

If you are working with the Family Inheritance Advanced at 23andMe, you can compare each of your matches with each other. I would still invite my matches to download to GedMatch so you can compare them with people who did not test at 23andMe.
If you are working with a group of people at Family Tree DNA, you can ask them to run themselves against each other to see if they also match on the same segment that they both match you on. If you are a project administrator on a project where they are all members, you can do this cross-check matching yourself. You can also ask them to download their results to GedMatch.
If your matches will download their results to GedMatch, you can run each individual against any other individual to confirm their common segment matches with you and with each other.

In reality, you will likely wind up with a mixture of matches on your match list and not everyone will upload to GedMatch.

Confirming that segments create a three way match when you share a common ancestor constitutes proof that you share that common ancestor and that particular DNA has been passed down from that ancestor to you.

I’ve built this confidence table relative to matches first found at Family Tree DNA, adding matches from Ancestry and following them to GedMatch. Fortunately, the Abija group has tested at all 3 companies and also uploaded their results to GedMatch. Some of my favorite cousins!

Spectrum of Confidence

Blaine Bettinger built this slide that sums up the tools and where they fall on the confidence range alone, without considerations of your goals and technical factors such as segment size. Thanks Blaine for allowing me to share it here.

These tools and techniques fall onto a spectrum of confidence, which I’ve tried to put into perspective, below.

I really debated how to best show these. Unfortunately, there is almost always some level of judgment involved. In some cases, like triangulation at the 3 vendors, the highest level is equivalent, but in other cases, like the medium range, it really is a spectrum from lowest to highest within that grouping.

Now, let’s take a look at our matches that we’ve added to our match list in confidence order.

As you would expect, those who triangulated with each other using some chromosome browser and share a common ancestor are the highest confidence matches – those 5 with a red Y. These are followed by matches who match me and each other but not on the same segment (or at least we don’t know that), so they don’t triangulate, at least not yet.

I didn’t include any low confidence matches in this table, but of the lowest ones that are included, the shakey leaf matches at Ancestry that won’t answer inquiries and the matches at FTDNA who do share a common surname but didn’t download their information to be triangulated are the least confident of the group. However, even those lower confidence matches on this chart are medium, meaning at Ancestry they are in a Circle and at FTDNA, they do match and share a common surname. At Family Tree DNA, they may eventually fall into a triangulation group of other descendants who triangulate.

Caveats

As always, there are some gotchas. As someone said in something I read recently, “autosomal DNA is messy.”

Endogamy

Endogamous populations are just a mess. The problem is that literally, everyone is related to everyone, because the founder population DNA has just been passed around and around for generations with little or no new DNA being introduced.

Therefore, people who descend from endogamous populations often show to be much more closely related than they are in a genealogical timeframe.

Secondly, we have the issue pointed out by David Pike, and that is when you really don’t know where a particular segment came from, because the segment matches both the parents, or in some cases, multiple grandparents. So, which grandparent did that actual segment that descended to the grandchild descend from?

For people who are from the same core population on both parent’s side, close matches are often your only “sure thing” and beyond that, hopefully you have your parents (at least one parent) available to match against, because that’s the only way of even beginning to sort into family groups. This is known as phasing against your parents and while it’s a great tool for everyone to use – it’s essential to people who descend from endogamous groups. Endogamy makes genetic genealogy difficult.

In other cases, where you do have endogamy in your line, but only in one of your lines, endogamy can actually help you, because you will immediately know based on who those people match in addition to you (preferably on the same segment) which group they descend from. I can’t tell you how many rows I have on my spreadsheet that are labeled with the word “Acadian,” “Brethren” and “Mennonite.” I note the common ancestor we can find, but in reality, who knows which upstream ancestor in the endogamous population the DNA originated with.

Now, the bad news is that Ancestry runs a routine that removes DNA that they feel is too matchy in your results, and most of my Acadian matches disappeared when Ancestry implemented their form of population based phasing.

Identical by Population

There is sometimes a fine line between a match that’s from an ancestor one generation further back than you can go, and a match from generations ago via DNA found at a comparatively high percentage in a particular population. You can’t tell the difference. All you know is that you can’t assign that segment to an ancestor, and you may know it does phase against a parent, so it’s valid, meaning not IBC or identical by chance.

Yes, identical by population segment matching is a distinct problem with endogamy, but it can also be problematic with people from the same region of the world but not members of endogamous populations. Endogamy is a term for the timeframe we’re familiar with. We don’t know what happened before we know what happened.

From time to time, you’ll begin to see something “odd” happened where a group of segments that you already have triangulated to one ancestor will then begin to triangulate to a second ancestor. I’m not talking about the normal two groups for every address – one from your Mom’s side and one from your Dad’s. I’m talking, for example, when my Mom’s DNA in a particular area begins to triangulate to one ancestral group from Germany and one from France. These clearly aren’t the same ancestors, and we know that one particular “spot” or segment range that I received from her DNA can only come from one ancestor. But these segment matches look to be breaking that rule.

I created the example below to illustrate this phenomenon. Notice that the top and bottom 3 all match nicely to me and to each other and share a common ancestor, although not the same common ancestor for the two groups. However, the range significantly overlaps. And then there is the match to Mary Ann in the middle whose common ancestor to me is unknown.

Generally, we see these on smaller segment groups, and this is indicative that you may be seeing an identical by population group. Many people lump these IBP (identical by population) groups in with IBC, identical by chance, but they aren’t. The difference is that the DNA in an IBP group truly is coming from your ancestors – it’s just that two distinct groups of ancestors have the same DNA because at some point, they shared a common ancestor. This is the issue that “academic phasing” (as opposed to parental phasing) is trying to address. This is what Ancestry calls “pileup areas” and attempts to weed out of your results. It’s difficult to determine where the legitimate mathematical line is relative to genealogically useful matches versus ones that aren’t. And as far as I’m concerned, knowing that my match is “European” or “Native” or “African” even if I can’t go any further is still useful.

Think about this, if every European has between 1 and 4% Neanderthal DNA from just a few Neanderthal individuals that lived more than 20,000 years ago in Europe – why wouldn’t we occasionally trip over some common DNA from long ago that found its way into two different family lines.

When I find these multiple groupings, which is actually relatively rare, I note them and just keep on matching and triangulating, although I don’t use these segments to draw any conclusions until a much larger triangulated segment match with an identified ancestor comes into play. Confidence increases with larger segments.

This multiple grouping phenomenon is a hint of a story I don’t know – and may never know. Just because I don’t quite know how to interpret it today doesn’t mean it isn’t valid. In time, maybe its full story will be revealed.

ROH – Runs of Homozygosity

Autosomal DNA tests test someplace over 500,000 locations, depending on the vendor you select. At each of those locations, you find a value of either T, A, C or G, representing a specific nucleotide. Sometimes, you find runs of the same nucleotide, so you will find an entire group of all T, for example. If either of your parents have all Ts in the same location, then you will match anyone with any combination of T and anything else.

In the example above, you can see that you inherited T from both your Mom and Dad. Endogamy maybe?

Sally, although she will technically show as a match, doesn’t really “match” you. It’s just a fluke that her DNA matches your DNA by hopping back and forth between her Mom’s and Dad’s DNA. This is not a match my descent, but by chance, or IBC (identical by chance.) There is no way for you to know this, except by also comparing your results to Sally’s parents – another example of parental phasing. You won’t match Sally’s parents on this segment, so the segment is IBC.

Now let’s look at Joe. Joe matches you legitimately, but you can’t tell by just looking at this whether Joe matches you on your Mom’s or Dad’s side. Unfortunately, because no one’s DNA comes with a zipper or two sides of the street labeled Mom and Dad – the only way to determine how Joe matches you is to either phase against Joe’s parents or see who else Joe matches that you match, preferable on the same segment – in other words – create either a match or ICW group, or triangulation.

Segment Size

Everyone is in agreement about one thing. Large segments are never IBC, identical by chance. And I hate to use words like never, so today, interpret never to mean “not yet found.” I’ve seen that large segment number be defined both 13cM and 15cM and “almost never” over 10cM. There is currently discussion surrounding the X chromosome and false positives at about this threshold, but the jury is still out on this one.

Most medium segments hold true too. Medium segment matches to multiple people with the same ancestors almost always hold true. In fact, I don’t personally know of one that didn’t, but that isn’t to say it hasn’t happened.

By medium segments, most people say 7cM and above. Some say 5cM and above with multiple matching individuals.

As the segment size decreases, the confidence level decreases too, but can be increased by either multiple matches on that segment from a common proven ancestor or, of course, triangulation. Phasing against your parent also assures that the match is not IBD. As you can see, there are tools and techniques to increase your confidence when dealing with small segments, and to eliminate IBC segments.

The issue of small segments, how and when they can be utilized is still unresolved. Some people simply delete them. I feel that is throwing the baby away with the bathwater and small segments that triangulate from a common ancestor and that don’t find themselves in the middle of a pileup region that is identical by population or that is known to be overly matchy (near the center of chromosome 6, for example) can be utilized. In some cases, these segments are proven because that same small segment section is also proven against matches that are much larger in a few descendants.

Tim Janzen says that he is more inclined to look at the number of SNPs instead of the segment size, and his comfort number is 500 SNPs or above.

The flip side of this is, as David Pike mentioned, that the fewer locations you have in a row, the greater the chance that you can randomly match, or that you can have runs of heterozygosity.

No one in our discussion group felt that all small segments were useless, although the jury is still out in terms of consensus about what exactly defines a small segment and when they are legitimate and/or useful. Everyone of us wants to work towards answers, because for those of us who are dealing with colonial ancestors and have already picked the available low hanging fruit, those tantalizing small segments may be all that is left of the ancestor we so desperately need to identify.

For example, I put together this chart detailing my matching DNA by generation. Interesting, I did a similar chart originally almost exactly three years ago and although it has seemed slow day by day, I made a lot of progress when a couple of brick walls fell, in particular, my Dutch wall thanks to Yvette Hoitink.

If you look at the green group of numbers, that is the amount of shared DNA to be expected at each level. The number of shared cMs drops dramatically between the 5^th and 6^th generation from 13 cM which would be considered a reasonable matching level (according to the above discussion) at the 5^th generation, and 3.32 cM at the 6^th generation level, which is a small segment by anyone’s definition.

The 6^th generation was born roughly in 1760, and if you look to the white grouping to the right of the green group, you can see that my percentage of known ancestors is 84% in the 5^th generation, 80% in the 6^th generation, but drops quickly after that to 39, 22 and 3%, respectively. So, the exact place where I need the most help is also the exact place where the expected amount of DNA drops from 13 to 3.32 cM. This means, that if anyone ever wants to solve those genealogical puzzles in that timeframe utilizing genetic genealogy, we had better figure out how to utilize those small segments effectively – because it may well be all we have except for the occasional larger sticky segment that is passed intact from an ancestor many generations past.

From my perspective, it’s a crying shame that Ancestry gives us no segment data and it’s sad that 23andMe only gives us 5cM and above. It’s a blessing that we can select our own threshold at GedMatch. I’m extremely grateful that FTDNA shows us the small segment matches to 1cM and 500 SNPs if we also match on 20cM total and at least one segment over 7cM. That’s a good compromise, because small segments are more likely to be legitimate if we have a legitimate match on a larger segment and a known ancestor. We already discussed that the larger the matching segment, the more likely it is to be valid. I would like to see Family Tree DNA lower the matching threshold within projects. Surname projects imply that a group of people will be expected to match, so I’d really like to be able to see those lower threshold matches.

I’m hopeful that Family Tree DNA will continue to provide small segment information to us. People who don’t want to learn how to use or be bothered with small segments don’t have to. Delete is perfectly legitimate option, but without the data, those of us who are interested in researching how to best utilize these segments, can’t. And when we don’t have data to use, we all lose. So, thank you Family Tree DNA.

Coming Full Circle

This discussion brings us full circle once again to goals.

Goals change over time.

My initial reason for testing, the first day an autosomal test could be ordered, was to see if my half-brother was my half-brother. Obviously for that, I didn’t need matching to other people or triangulation. The answer was either yes or no, we do match at the half-sibling level, or we don’t.

He wasn’t. But by then, he was terminally ill, and I never told him. It certainly explained why I wasn’t a transplant match for him.

My next goal, almost immediately, was to determine which if either my brother or I were the child of my father. For that, we did need matching to other people, and preferably close cousins – the closer the better. Autosomal DNA testing was new at that time, and I had to recruit cousins. Bless those who took pity on me and tested, because I was truly desperate to know.

Suffice it to say that the wait was a roller coaster ride of emotion.

If I was not my father’s child, I had just done 30+ years of someone else’s genealogy – not a revelation I relished, at all.

I was my father’s child. My brother wasn’t. I was glad I never told him the first part, because I didn’t have to tell him this part either.

My goal at that point changed to more of a general interest nature as more cousins tested and we matched, verifying different lineages that has been unable to be verified by Y or mtDNA testing.

Then one day, something magical happened.

One of my Y lines, Marcus Younger, whose Y line is a result of a NPE, nonparental event, or said differently, an undocumented adoption, received amazing information. The paternal Younger family line we believed Marcus descended from, he didn’t. However, autosomal DNA confirmed that even though he is not the paternal child of that line, he is still autosomally related to that line, sharing a common ancestor – suggesting that he may have been born of a Younger female and given that surname, while carrying the Y DNA of his biological father, who remains unidentified.

Amazingly, the next day, a match popped up that matched me and another Younger relative. This match descended not from the Younger line, but from Marcus Younger’s wife’s alleged surname family. I suddenly realized that not only was autosomal DNA interesting for confirming your tree – it could also be used to break down long-standing brick walls. That’s where I’ve been focused ever since.

That’s a very different goal from where I began, and my current goal utilizes the tools in a very different way than my earlier goals. Confidence levels matter now, a great deal, where that first day, all I wanted was a yes or no.

Today, my goal, other than breaking down brick walls, is for genetic genealogy to become automated and much easier but without taking away our options or keeping us so “safe” that we have no tools (Ancestry).

The process that will allow us to refine genetic genealogy and group individuals and matches utilizing trees on our desktops will ultimately be the key to unraveling those distant connections. The data is there, we just have to learn how to use it most effectively, and the key, other than software, is collaboration with many cousins.

Aside from science and technology, the other wonderful aspect of autosomal DNA testing is that is has the potential to unite and often, reunite families who didn’t even know they were families. I’ve seen this over and over now and I still marvel at this miracle given to us by our ancestors – their DNA.

So, regardless of where you fall on the goals and matching confidence spectrum in terms of genetic genealogy, keep encouraging others to test and keep reaching out and sharing – because it takes a village to recreate an ancestor! No one can do it alone, and the more people who test and share, the better all of our chances become to achieve whatever genetic genealogy goals we have.

______________________________________________________________

Disclosure

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Legacy Tree Genealogists for genealogy research

Demystifying Autosomal DNA Matching

Posted on January 17, 2015 by Roberta Estes

What, exactly, is an autosomal DNA match?

Answer: It’s Relative

I’m sorry, I just had to say that.

But truthfully, it is.

I know this sounds like a very basic question, and it is, but the answer sometimes isn’t as straightforward as we would like for it to be.

Plus, there are differences in quality of matches and types of matches. If you want to sigh right about now, it’s OK.

We’ve talked a lot about matching in various recent articles. I have several people who follow this blog religiously, and who would rather read this than, say, do dishes (who wouldn’t). One of our regulars recently asked me the question, “what, exactly, is a match and how do I tell?”

Darned good question and I wish someone had explained this to me so I wouldn’t have had to figure it out.

In the computer industry, where I spent many years, we have what we call flow charts or wernier diagrams which in essence are logic paths that lead to specific results or outcomes depending on the answers at different junctions.

I had a really hard time deciding whether to use the beer decision-making flow chart or the procrastinator flow chart, but the procrastinator flow chart was just one big endless loop, so I decided on the beer.

What I’m going to do is to step you through the logic path of finding and evaluating a match, determining whether it’s valid, identical by descent or chance, when possible, and how to work with your matches and what they mean.

Let me also say that while I use and prefer Family Tree DNA, these matching techniques are universal and apply to results from 23andMe as well, but not for Ancestry who gives you no browser or tools to compare your DNA to anyone else. So, you can’t compare your results at Ancestry.

Comparing DNA results is the lynchpin of genetic genealogy. You’re dead in the water without it. If you have tested at Ancestry, you can always transfer your results to Family Tree DNA, where you do have tools, and to GedMatch as well. You’re always better, in terms of genealogy, to fish in as many ponds as possible.

Before we talk about how to work with matches, for those who need to figure out how to find matches at Family Tree DNA and 23andMe, I wrote about that in the Chromosome Browser War article. This article focuses on working with matching DNA after you have found that you are a match to someone – and what those matches might mean.

Matching Thresholds

All autosomal DNA vendors have matching thresholds. People who meet or exceed those thresholds will be shown on your match list. People who do not meet the initial threshold will not be considered as a match to you, and therefore will not be on your match list.

Currently, at Family Tree DNA, their match threshold to be shown as a match is about 20cM of total matching DNA and a single segment of about 7.7cM with 500 SNPs or over. The words “about” are in there because there is some fuzziness in the rules based on certain situations.

After you meet that criteria and you are shown as a match to an individual, when you download your matching data, your matches to them on each chromosome will be shown to the 1cM and 500 SNP level

At 23andMe, the threshold is 7cMs/700 SNPs for the first segment. However, 23andMe has an upper limit of people who can match you at about 1000 matches. This can be increased by the number of people you are communicating or sharing with. However, your smallest matches will be dropped from your list when you hit your threshold. This means that it’s very likely that at least some of your matches are not showing if you have in excess of 1000 matches total. This means that your personal effective cM/SNP match threshold at 23andMe may be much higher.

Step 1 – Downloading Your Matching Segments

For this comparison, I’m starting with two fresh files from Family Tree DNA, one file of my own matches and one of my mother’s matches. My mother died before autosomal DNA testing was available, so her results are only at Family Tree DNA (and now downloaded to GedMatch,) because her DNA was archived there. Thank you Family Tree DNA, 100,000 times thank you!!!

At Family Tree DNA, the option to download all matches with segment information is on the chromosome browser tab, at the top, at the right, shown below.

If you have your parents DNA available to test and it hasn’t been tested, order a kit for them today. If either or both parents have been tested, download their results into the same spreadsheet with yours and color code them in a way you will understand.

In my case, I only have my mother’s results, and I color coded my matches pink, because I’m the daughter. However, if I had both parents, I might have colored coded Mother pink and Dad blue.

Whatever color coding you do, it’s forever in your master spreadsheet, so make a note of what it is. In my case, it’s part of the match column header. Why is it in my column header? Because I screwed up once and reversed them in a download.

Step 2 – Preparing and Sorting Your Spreadsheet

In my master DNA spreadsheet, I have the following columns,

The green cell matches are matches to me from 23andMe. My cousin, Cheryl also tested at 23andMe before autosomal testing was offered at Family Tree DNA.

The Source column, in my spreadsheet, means any source other than FTDNA. The Ignore column is an extraneous number generated at one time by downloads. I could delete that column now.

The “Side” column is which side the match is from, Mom or Dad. Mom’s I can identify easily, because I have her DNA to compare to. I don’t identify a match as Dad’s without having identified an ancestral line, because I don’t have his DNA to compare to.

And no, you can’t just assume that if it doesn’t match Mom, it’s an automatic match to Dad because you may have some IBS, identical by chance, matches.

The Common Ancestors/Comments column is just that. I include things like when I e-mailed someone, if the match is triangulated and if so, with whom, etc.

In my master spreadsheet, the first “name” column (of who tested) is deleted, but I’ve left it in the working spreadsheet (below) with my mother for illustration purposes. That way, neither of us has to remember who is pink!

Step 3 – Reviewing IBD and IBS Guidelines

If you need a refresher on, phasing, IBD, identical by descent, IBS which can mean either identical by chance or identical by population, it would be a good time to read or reread the article titled How Phasing Works and Determining IBD Versus IBS Matches.

Let’s briefly review the IBD vs IBS guidelines, because we’ll be applying them in this article.

Identical by Chance – Can be determined if an individual you match does not match to one of your parents, if parents are available. If parents are not available for matching, IBS by chance segments won’t triangulate with other known genealogical matches on a common segment.

Identical by Descent – Can be suggested if a common ancestor (or ancestral line) can be determined between any two people who are not known relatives. If the two people are known close relatives, and their DNA matches, identical by descent is proven. IBD can be proven with previously unknown family or genealogical matches when any three people descending from that same ancestor or ancestral line all match each other on the same segment of DNA. Three way matching is called triangulation.

Identical by Population – Can be determined when multiple people triangulate with you on a specific segment of DNA, but the triangulated groups are from proven different lineages and are not otherwise related. This is generally found in smaller segments from similar regions of the world. Identical by population is identical by descent, but the ancestors are so far back in time that they cannot be determined and may contribute the same DNA to multiple lineages. This is particularly evident in Jewish genealogy and other endogamous groups.

Step 4 – Determining Parental Side and IBS by Chance

The first thing to do, if you have either or both parents, is to determine whether your matches phase to your parents or are IBS by chance.

In this context, phasing means determining whether a particular match is to your father’s side of the family or to your mother’s side of the family.

Remember, at every address in your DNA, you will have two valid matches to different lines, one from your mother and one from your father. The address on your DNA consists of the chromosome number which equates to the street name, and then the start and end locations, which consists of a range of addresses on that street. Think of it as the length of your property on the street.

First, let’s look at my situation with only my mother’s DNA for comparison.

It’s easy to tell one of three things.

Do mother and I both match the person? If so, that means that DNA match is from mother’s side of the family. Mark it as such. They are green, below.
If the individual does not match me and mother, both, and only matches me, then the match is either on my father’s side or it’s IBS by chance. Those matches are blue below. Because I don’t have my father’s DNA, I can’t tell any more at this step.
Notice the matches that are Mom’s but not to me. That means that I did not receive that DNA from Mom, or I received a small part, but it’s not over the lowest matching threshold at Family Tree DNA of 1cM and 500 SNPs.

In this next scenario, you can see that mother and I both match the same individual, but not on all segments. I selected this particular match between me, my mother and Alfred because it has some “problems” to work through.

The segments shown in green above are segments that Mom carries that I don’t. This means that I didn’t receive them from mother. This also means they could be matching to Alfred legitimately, or are IBS by chance. I can’t tell anything more about them at this point, so I’ve just noted what they are. I usually mark these as “mother only” in my master spreadsheet.

The first of the two green rows above show a match but it’s a little unusual. My segment is larger than my mothers. This means that one of five things has happened.

Part of this segment is a valid match. At the end, where we don’t match, the match extends IBS by chance a bit at the end, in my case, when matching Alfred. The valid match portion would end where my mother’s segment ends, at 16,100,293
There is a read error in one of the files.
The boundary locations are fuzzy, meaning vendor calculations like ‘healing’ for no calls, etc..
I also match to my father’s line.
Recombination has occurred, especially possible in an endogamous population, reconnecting identical by population segments between me and Alfred at the end of the segment where I don’t match my mother’s segment, so from 16,100,293 to 16,250,884.

Given that this is a small segment, the most likely scenario would be the first, that this is partly valid and partly IBS by chance. I just make the note by that row.

The second green segment above isn’t an exact match, but if my segment “fits within” the boundaries of my mother’s segments, then we know I inherited the entire segment from her. Once again, my boundaries are off a bit from hers, but this time it’s the beginning. The same criteria applies as in 1-5, above.

The green segments above are where I match Alfred, but my mother does not. This means that these segments are either IBS by chance or that they will match my father. I don’t know which, so I simply label them. Given that they are all small segments, they are likely IBS by chance, but we don’t know that. If we had my father’s DNA, we would be able to phase against him, too, but we don’t.

Now, if I was to leave this discussion here, you might have the impression that all small segment matches have problems, but they don’t. In fact, here’s a much more normal “rea life” situation where mother and I are both matching to our cousin, Cheryl, Mom’s first cousin. These matches include both large and small segments. Let’s take a look and see what we can tell about our matches.

Roberta and Barbara have a total of 83 DNA matches to Cheryl.

Some matches will be where Barbara matches Cheryl and Roberta doesn’t. That’s normal, Barbara is Roberta’s mother and Roberta only inherits half of Barbara’s DNA. These rows where only Barbara, the mother, matches Cheryl are not colorized in the Start, End, cM and SNP columns, so they show as white.

Some matches will be exact matches. That too is normal. In some cases, Barbara passes all of a particular segment of DNA to Roberta. These matches are colored purple.

Some of these matches are partial matches where Roberta inherited part of the segment of DNA from Barbara. These are colored green. There are two additional columns at right where the percentage of DNA that Roberta inherited from Barbara on these segments is calculated, both for cM and SNPs.

Some of the matches are where Roberta matches Cheryl and Barbara doesn’t. Cheryl is not known to be related to Roberta on her father’s side, so assuming that statement is correct, these matches would be IBS, identical by state, meaning identical by chance and can be disregarded at legitimate matches. These are colored rust. Note that most of these are small segments, but one segment is 8.8cM and 2197 SNPs. In this case, if this segment becomes important for any reason, I would be inclined to look at the raw data file of Barbara to see if there were no calls or a problem with reads in this region that would prevent an otherwise legitimate match.

Let’s look at how these matches stack up.

	Number	Percent (rounded)	Comment
Exact Matches	26	31	100% of the DNA
Barbara Only	20	24	0% of the DNA
Partial Matches	29	35	11-98% of the actual DNA matches
Roberta Only (IBS by chance)	7	8	Not a valid match

I think it’s interesting to note that while, on the average, 50% of the DNA of any segment is passed to the child, in actuality, in this example of partial inheritance, meaning the green rows, inheritance was never actually 50%. In fact, the SNP and cM percentages inherited for the same segment varied, and the actual amounts ranged from 11-98% of the DNA of the parent being inherited by the child. The average of these events was 54.57143 (cM) and 54.21429 (SNPs) however.

On top of that, in 13 (26 rows) instances, Roberta inherited all of Barbara’s DNA in that sequence, and in 20 cases, Roberta inherited none of Barbara’s DNA in that sequence.

This illustrates that while the average of something may be 50%, none of the actual individual values may be 50% and the values themselves may include the entire range of possibilities. In this case, 11-98% were the actual percentage ranges for partial matches.

Matching Both Parents

I don’t have my father’s DNA, but I’m creating this next example as if I did.

Matches to mother are marked in green.

I have two matches where I match my father, so we can attribute those to his side, which I’ve done and marked in orange.

The third group of matches to me, at the bottom, to Julio, Anna, Cindy and George don’t match either parent, so they must be IBS by chance.

I label IBS by chance segments, but I don’t delete them because if I download again, I’ll have to go through this same analysis process if I don’t leave them in my spreadsheet

Step 5 – How Much of the DNA is a Match?

One person asked, “exactly how do I tell how much DNA is matching, especially between three people.” That’s a very valid question, especially since triangulation requires matching of three people, on the same segment, proven to a common ancestral line.

Let’s look at the match of both me and my mother to Don, Cheryl and Robin.

In this example, we know that Don, Cheryl and Robin all match me on my mother’s side, because they all three match me and my mother, both on the same segment.

How do we determine that we match on the same segment?

I have sorted this spreadsheet in order of end location, then start location, then chromosome number so that the entire spreadsheet is in chromosome order, then start location, then end location.

We can see that both mother and I match Cheryl partially on this segment of chromosome 1, but not exactly. The start location is slightly different, but the end location matches exactly.

The area where we all three match, meaning me, Mom and Cheryl, begins at 176,231,846 and ends at the common endpoint of 178,453,336

On the chart below, you can see that mother and I also both match Don, Cheryl’s brother, on part of this same segment, but not all of the same segment.

The common matching areas between me, Mom and Don begins at 176,231,846 and ends at 178,453,336.

Next, let’s look at the third person, Robin.

Mom and I both match Robin on part of this same overlapping segment as well. Note that my segment extends beyond Mom’s, but that does not invalidate the portion that does match between Robin, Mom and I.

Our common match area begins at the same location, but ends at 178,453,336, the same location as the common end area with Don and Cheryl

Step 6 – What Do Matches Mean? IBD vs IBS in Action

So, let’s look at various types of matches and what they tell us.

Looking at our matching situation above, let’s apply the various IBD/IBS rules and guidelines and see what we have

1. Are these matches identical by chance? No. How do we know?

a. Because they all match both me and a parent.

2. Are these matches identical by descent? Yes. How do we know?

a. Because we all match each other on this segment, and we know the common ancestor of Cheryl, Don, Barbara and me is Hiram Ferverda and Evaline Miller. We know that Robin descends from the same ancestral Miller line.

3. Are these matches identical by population. We don’t know, but there is no reason at this point to think so. Why?

a. Because looking at my master spreadsheet, I see no evidence that these segments are also assigned to other lineages. These individuals are also triangulated on a large number of other, much larger, segments as well.

4. Are these matches triangulated, meaning they are proven to a common ancestor? Yes. How do we know?

a. Documented genealogy of Hiram Ferverda and Evaline Miller. Don, Barbara, Cheryl and me are known family since birth.
b. Documented genealogy of Robin to the same ancestral family, even though Robin was previously unknown before DNA matching.
c. Even without the documented genealogy, Robin matches a set of two triangulation groups of people documented to the same ancestral line, which means she has to descend from that same line as well.

In our case, clearly these individuals share a common ancestor and a common ancestral line. Even though these are small segments on chromosome 1, there are much larger matching segments on other chromosomes, and the same rules still apply. The difference might be at some point smaller segments are more likely to be identical by population than larger segments. Larger segments, when available, are always safer to use to draw conclusions. Larger groups of matching individuals with known common genealogy on the same segments are also the safest way to draw conclusions.

Step 7 – Matching With No Parents

Sometimes you’re just not that lucky. Let’s say both of your parents have passed and you have no DNA from them.

That immediately eliminates phasing and the identical by chance test by comparing to your parents, so you’ll have to work with your matches, including your identical by chance segments.

A second way to “phase” part of your DNA to a side of your family is by matching with known cousins or any known family member.

In the situation above, matching to Cheryl, Don and Robin, let’s remove my mother and see what we have.

In this case, I still match to both of my first cousins, once removed, Cheryl and Don. Given that Cheryl and Don are both known cousins, since forever, I don’t feel the need for triangulation proof in this case – although the three of us are triangulated to our common ancestor. In other words, the fact that my mother does match them at the expected 1^st cousin level is proof enough in and of itself if we only had one cousin to test. We know our common ancestor is Cheryl and Don’s grandparents, who are my great-grandparents, Hiram Ferverda and Evaline Miller.

When I looked at Robin’s pedigree chart and saw that Robin descended from Philip Jacob Miller and wife Magdalena, I knew that this segment was a Miller side match, not a Ferverda match.

Therefore, matching with someone whose genealogy goes beyond the common ancestor of Cheryl, Don and me proves this line through 4 more generations. In other words, this DNA segment came through the following direct line to reach Me, Mother, Cheryl and Don.

Philip Jacob Miller and Magdalena
Daniel Miller
David Miller
John David Miller
Evaline Louise Miller who married Hiram Ferverda

Clearly, we know from the earlier chart that my mother carried this DNA too, but even if we didn’t know that, she obviously had to have carried this segment or I would not carry it today.

So, even though in this example, our parents aren’t directly available for IBS testing and elimination, we can determine that anyone who matches both me and Cheryl or me and Don will have also matched mother on that segment, so we have, in essence, phased those people by triangulation, not by direct parental matching.

Step 8 – Triangulation Groups

What else does this match group tell us?

It tells us that anyone else who matches me and any one of our triangulation group on that segment also descends from the Miller descendant clan, one way or another.

Why do they have to match me AND one of the triangulation group members on that segment? Because I have two sides to my DNA, my Mom’s side and my Dad’s side. Matching me plus another person from the triangulation group proves which side the match is on – Mom’s or Dad’s.

We were able to phase to eliminate any identical by chance segments people on Mom’s side, so we know matches to both of us are valid.

On Dad’s side, there are some IBS by chance people (or segments) thrown in for good measure because I don’t have my Dad’s DNA to eliminate them out of the starting gate. Those IBS segments will have to be removed in time by not triangulating with proven triangulated groups they should triangulate with, if they were valid matches.

When you map matches on your chromosome spreadsheet, this is what you’re doing. Over time, you will be able to tell when you receive a new match by who they match and where they fall on your spreadsheet which ancestral line they descend from.

GedMatch also includes a triangulation utility. It’s a great tool, because it produces trios of people for your top 400 matches. The results are two kits that triangulate to the third person whose kit number you are matching against.

The output, below, shows you the chromosome number followed by the two kit numbers (obscured) that triangulate at this location, and then the start and end location followed by the matching cMs. The result is triangulation groups that “slide to the right.”

In the example above, all of the triangulation matches to me above the red arrow include either Mother, my Ferverda cousins or the Miller group that we discussed in the Just One Cousin article. In other words they are all related via a common ancestor.

You can tell a great deal about triangulation groups by who is, and isn’t in them using deductive reasoning. And once you’ve figured out the key to the group, you have the key to the entire group.

In this case, Mom is a member of the first triangulation group, so I know this group is from her side and not Dad’s side. Both Ferverda cousins are there, so I know it’s Mom’s Dad’s side of the family. The Miller cousins are there, so I know it’s the Miller side of Mom’s Dad’s side of the family.

Please also note that while this entire group triangulates within itself, that the group manages to slide right and the first triangulated group of 3 in the list may not overlap the DNA of the last triangulated group of 3. In fact, because you can see the start and end points, you can tell that these two triangulated groups don’t overlap. The multiple triangulation groups all do match some portion of the group above and below them (in this case,) and as a composite group, they slide to the right. Because each group overlaps with the group above and below them, they all connect together in a genetic chain. Because there is an entire group that are triangulated together, in multiple ways, we know that it is one entire group.

This allows me to map that entire segment on my Mom’s side of my DNA, from 10,369,154 to 41,685,667 to this group because it is contiguously connected to me, triangulated and unbroken. The most distant ancestor listed will vary based upon the known genealogy of the three people being triangulated For example, part of this segment, may come from Philip Jacob Miller himself, the line’s founder,, but another part could come from his son’s wife, who is also my ancestor. Therefore, the various pieces of this group segment may eventually be attributed to different ancestors from this particular line based upon the oldest common ancestor of the three people who have triangulated.

In our example above, the second group starts where the red arrow is pointing. I have absolutely no idea which ancestor this second group comes from – except – I know it does not come from my mother’s side because her kit number isn’t there.

Neither are any of my direct line Estes or Vannoy relatives, so it’s probably not through that line either. My Bolton cousins are also missing, so we’ve probably eliminated several possible lines, 3 of 4 great grandparents, based on who is NOT in the match group. See the value of testing both close and distant cousins? In this case, the family members not only have to test, they also have to upload their results to GedMatch.

Conversely, we could quickly identify at least a base group by the presence in the triangulation groups of at least one my known cousins or people with whom I’ve identified my common ancestor. Two from the same line would be even better!!!

Endogamy

The last thing I want to show you is an example of what an endogamous group looks like when triangulated.

This segment of chromosome 9 is an Acadian matching group to my Mom – and the list doesn’t stop here – this is just the size of the screen shot. These matches continue for pages.

How do I know this group is Acadian? In part, because this group also triangulates with my known Lore cousin who also descends from the same Acadian ancestor, Antoine Lore, son of Honore Lore and Marie Lafaille. Additionally, I’ve worked with some of these people and we have confirmed Honore Lore and Marie Lafaille as our common ancestor as well. In other cases, we’ve confirmed upstream ancestors.

Unfortunately, the Acadians are so intermarried that it’s very difficult to sort through the most distant genetic ancestor because there tend to be multiple most distant ancestors in everyone’s trees. There is a saying that if you’re related to one Acadian, you’re related to all Acadians and it’s the truth. Just ask my cousin Paul who I’m related to 137 different ways.

Matches to endogamous groups tend to have very, very long lists of matches, even triangulated, which means proven, matches.

Oh, and by the way, just for the record, this lengthy group includes some of my proven Acadian matches that were trimmed, meaning removed, from my match list when Ancestry did their big purge due to their new and improved phasing. So if there was ever any doubt that we did in fact lose at least some valid matches, the proof lies right here, in the triangulation of those exact same people at GedMatch

Summary

I hope this step by step article has helped take the Greek, or maybe the geek, out of matching. Once you think of it in a step by step logical basis, it makes a lot of sense and allows you to reasonably judge the quality of your matches.

The rule of thumb has been that larger matches tend to be “legitimate” and smaller matches are often discarded en masse because they might be problematic. However, we’ve seen situations where some larger matches may not be legitimate and some smaller matches clearly are. In essence, the 50% average seldom applies exactly and rules of thumb don’t apply in individuals situations either. Your situation is unique with every match and now you have tools and guidelines to help you through the matching maze.

And hey, since we made it to the end, I think we should celebrate with that beer!!!

______________________________________________________________

Disclosure

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Legacy Tree Genealogists for genealogy research

Just One Cousin

Posted on January 11, 2015 by Roberta Estes

Recently, someone wrote to me and said that they thought the autosomal DNA matching between groups of family members was wonderful, but they have “just one first cousin” and feel left out. So, I decided to see what could be done with just two cousins. In this case, the two cousins are full siblings and both first cousins to my mother, Barbara. This would be the same process whether there was one or two cousins, since the two are siblings. Utilizing two cousins who are siblings just gives me the advantage of additional matching and triangulation capabilities.

This does presume that both people involved are willing to share and do a bit of comparison work on their various DNA accounts. In other words, you can’t do this by yourself without cooperation from your cousin.

Here’s the common ancestor of our testers.

Barbara, Cheryl and Don took a Family Finder autosomal DNA test at Family Tree DNA.

The DNA shared by Barbara, Cheryl and Don is from their common ancestral couple, Hiram B. Ferverda and Evaline Louise Miller.

Some of that shared DNA will be Hiram’s Ferverda DNA and some will be Evaline’s Miller DNA. The only way to differentiate between the Ferverda and Miller DNA is to test people who are only Ferverda or only Miller, descendants of people upstream of Hiram and Evaline, and if there are any common segments between the testers and those Ferverda or Miller individuals, you can then assign that DNA segment to that side of the family – Miller or Ferverda.

I’m using Barbara’s chromosome as the “match to” background, below. Cheryl, in orange, and Don, in blue, are shown as matches to Barbara. You can see that these three people share a lot of their grandparents DNA. You can also see where Don and Cheryl didn’t inherit the same DNA from their father, in some instances, like on chromosome 1 below, where Cheryl (orange) matches Barbara on a much larger part of the chromosome than Don does (blue.) But then look at chromosome 13 where Barbara and Don match on a huge segment and Cheryl, just a small portion. Don and Cheryl inherited different DNA from their parents at these locations.

The three testers’ common DNA segments on chromosome 1 are shown in the table below. I’ve colored Cheryl’s pink and her brother, Don’s, blue. You can see that Barbara matches some segments with Don that Cheryl didn’t inherit from her parents. All of the DNA Barbara matches with Cheryl on this chromosome is also matched, at least in part, in that location, with Don. The chart below, matches the graphic above, for chromosome 1 and is the “view data in a table” option on the chromosome browser as well as the leftmost “download to excel” option. The download to excel option at right downloads all of the matches for the individual, not just the ones currently showing in the chromosome browser.

When at least two known relatives have tested, we have something to compare against. In this case, we have a total of 3 people, 2 siblings and a first cousin, before we start matching outside known family. We don’t know which of their shared DNA comes from which ancestor, but we can now look for people who match Barbara and at least Cheryl OR Don which proves a common ancestor between the three individuals. Matching Barbara, Cheryl AND Don would be even better.

The gold standard for DNA matching, called triangulation, that proves a particular segment to a specific ancestor is as follows.

All (at least 2) people match you on the same segment.
Those people also match each other on the same segment.
Meaning, at least three people with a known common ancestral line must match on the same segment.

The key word here is “on the same segment”.

The next thing to do is to find out which of Barbara’s, Cheryl’s and Don’s matches are “in common with” each other. This means Barbara, Cheryl and Don all share a matching segment with these other people, but without additional analysis, we can’t determine whether they share a match on the same segment or not.

I ran Barbara “in common with” Cheryl and you can see that the first two people returned on that match list were me and Don because matches are listed in the order of the largest cM of shared data first. The “in common with” tool is the blue crossed arrows, below.

Next I ran Barbara in common with Don.

There were a total of 43 people in common with Cheryl and 49 with Don.

I downloaded the matching individuals (download link at the bottom right of the match page) and sorted them in a spreadsheet to see who matches whom. Here’s what the first part of my spreadsheet looks like (sorted in chromosome and segment order.) I colorized the rows by cousin for easier visualization.

We have 92 total matching individuals in common with Barbara and Cheryl and then Barbara and Don. A total of 19 people are listed as matching BOTH Cheryl and Don (for a total of 38 rows in the spreadsheet), so that means that there are 54 people who are in common with either Barbara and Cheryl or Barbara and Don, but not in common with all 3, Barbara AND Cheryl AND Don. This illustrates how differently siblings inherit DNA from their parents and how it affects matches another generation later.

In Common With Matches	To both Don and Cheryl	To Cheryl only or Don only, but not both
Barbara	19 (38 rows of 92)	54

Clearly, the people who match all three individuals, Barbara, Cheryl and Don are likely the closest relatives.

So let’s focus on those closest matching people. If you were utilizing only one cousin here, you would simply utilize every “in common with” match between two individuals and move forward. Because I have siblings here, and because I don’t want to deal with 72 different people, I’m using the fact that they are siblings to focus my efforts on the most closely related matches – people who match Barbara AND both siblings. You could also limit your focus by something like a common ancestral surname between all match members.

The next step is for each tester, meaning Barbara, Cheryl and Don, to compare each individual on the common match list to their DNA. This means that Barbara, Cheryl and Don all three will compare to all 18 individuals. We now have only 18 matching people, instead of 19, because I removed my own matches, since mine are a subset of Barbara’s. Checking to see how each of our testers matches each common matching person is the only way to determine that there is a three (or 4) way triangulation that will confirm a common ancestor.

There are two ways to do this at Family Tree DNA.

1. You can, 5 matches at a time, compare in the chromosome browser, then download only the matching segments to a spreadsheet for those 5 individuals. This means 4 sets of matches for each of three people.

Two cousins browser download

2. You can download Barbara, Cheryl and Don’s entire segment match list and then eliminate the matches that aren’t relevant to the discussion – meaning everyone except the 18 common matches between the three people.

The download option for the entire segment match list for the person whose kit you are looking at is shown at the top of the chromosome browser, to the right. Downloading the currently showing individuals matching segments is shown at the top of the chromosome browser, to the left.

Because we can only push 5 people at a time to the chromosome browser, in this case, it will be easier to simply download all of the matches for each of the three individuals and then put them into a common spreadsheet and sort by the names we determined match in common between all three cousins.

I downloaded all of the matches for Barbara, Cheryl and Don, colorized them and then sorted them in the spreadsheet by the name of who they matched. I then searched for the names of the 18 individuals who matched Barbara, Cheryl and Don, and copy/pasted them into a separate spreadsheet.

I could then sort the 18 matching individuals results by chromosome and start and end location.

Barbara’s DNA matches are white rows, Cheryl’s are pink and Don’s are blue.

The segments where Barbara, Don and Cheryl all match more than one other person on an overlapping area of their DNA segments are colorized green. This means that 4 or more people match on that same identical segment, the three known cousins and at least one other person.

The segments where at least Barbara and either Don or Cheryl (but not both) match at least one other person are colorized yellow. This means that least three people match on that same segment.

Since the gold standard of triangulation is 3 individuals matching on the same segment, both the yellow and green segments contain matches that fall into this category and are triangulated. All of those segments match at least two of the cousins, who match each other, plus in some cases, additional people too.

Let’s walk through one triangulation sequence.

In the green cluster, above, you can see that Barbara, Cheryl and Don all match Arthur on overlapping portions of the same segment. The overlapping portion between all 3 individuals and Arthur runs from 49,854,186 to 53,551,492. In addition, both Don and Cheryl match Tiffany on part of that same segment and Barbara matches Dean on part as well. These segments aren’t exactly the same for any of the cousins, with different amounts of matching DNA as reflected in the different cM and SNP values.

So, who is triangulated based on just this one green cluster? Barbara, Cheryl, Don and Arthur are triangulated to a common ancestor. We know that common ancestor is either the common ancestor of Cheryl, Don and Barbara – Hiram Ferverda and Evaline Miller – or upstream of that couple.

Tiffany is triangulated to both Cheryl and Don, but since Cheryl and Don are siblings, that’s irrelevant at this point – meaning we can’t tell if that match is IBS by chance or real because there is no additional match – at least not in this cluster.

In total, there are 19 green clusters (triangulated to at least 4 people) and 12 yellow clusters (triangulated to at least 3 people.)

In other words, the DNA that came from Hiram Ferverda and Evaline Miller is present in these matching people as well. The million dollar question, is, of course, which upstream ancestor did it come from? We genealogists are never satisfied, are we? Every answer just leads to more questions.

Before we begin looking at the DNA results and discussing what they mean, I want to share with you the family tree of Hiram Ferverda and Evaline Miller, because the DNA of the people who match Don, Cheryl and Barbara had to come from these people as well. This chart shows 7 generations back from Barbara, Cheryl and Don. The common ancestors of the people with whom they triangulate are likely to be within this timeframe.

The colorized ancestors above are the ancestors who contributed the X chromosome to both John Ferverda, Barbara’s father and Roscoe Ferverda, Cheryl and Don’s father.

In my working example, below, I’m utilizing the matches on chromosome 14 because chromosome 14 includes examples of a couple of interesting features.

Let’s look at the first green grouping. All three cousins match to SB and then Barbara matches also to Constance and William, our Lentz cousin on part of that overlapping segment as well. This suggests that this grouping might come from the Lentz side of the Miller tree, although we’ll see something else in a minute that might give us pause to reflect. So just hold that thought. Regardless, it does tell us that these individuals do share a common ancestor and it’s on the Miller side, not the Ferverda side.

The second green grouping is larger and includes larger segments as well, which are more reliably used, although the smaller green cluster clearly meets and exceeds the triangulation requirement of 3 matching individuals on the same segment.

This larger green cluster is actually quite interesting, because there are a total of 4 individuals, Ellen, Arthur, Eric and Tiffany who are all triangulated on this same segment with Don, Cheryl and Barbara. So, not only are they triangulated to Don, Cheryl and Barbara, but also to each other. These 7 people all share a common ancestor.

The yellow grouping shows an area where Eric matches Barbara and Don plus Arthur as well, but not Cheryl. We don’t know anything about Arthur or Eric’s genealogy, so we don’t know if this is Miller or Ferverda DNA, at least not yet. We’ll learn more about Arthur and Eric in a minute, even without their genealogy!

There are a couple of other areas on other chromosomes that are of interest too.

On this cluster on chromosome 12, we find a known Miller cousin, Rex, 2nd cousin to Barbara, Cheryl and Don. Because Rex also descends from the parents of Evaline Miller, we know that this segment shared with Rex has to be Miller DNA, not Ferverda DNA.

On this segment of chromosome 3, below, we see that Barbara, Cheryl and Don match Herbert, another known Miller cousin, plus Dee and Constance in much smaller amounts on the same segment. This tells us that this segment is descended from our common ancestor with Herbert.

Barbara, Don and Cheryl’s common ancestor with Herbert is Daniel Miller and Elizabeth Ulrich (Ullery), which makes them third cousins once removed – except – Herbert got a second dose of Miller DNA because Daniel Miller’s son, Isaac, married his first cousin who was also a Miller and shared grandparents with him. So Herbert, genetically, is closer than he would appear since he received the double dose of Miller DNA three generations upstream.

Gotta love these close knit families. The Millers were Brethren. These double doses of family DNA often carry forward by matching downstream when they might otherwise not be expected do so. That’s the upside of these endogamous groups. Now, here’s the downside.

See the segments with the words problem written to the right? Do you recognize what the problem is? You’ll notice that in the matching group we have BOTH cousin Herbert who is a Miller (and not a Lentz) and cousin William who is a Lentz (and not a Miller.)

This is a very common situation in endogamous communities.

To make matters worse, we are dealing with very small segments here, where we often see confusion. However, let’s look at the possibilities.

We do have triangulation, so one of three things has happened here.

First, the Brethren are an endogamous population that intermarried nearly exclusively within their faith. The Lentz and Miller families were both Brethren.

Here are our possibilities.

Our Lentz cousin has some Miller in one of his lines. This is entirely possible since he has a “short” pedigree chart and his families are living in the same Brethren communities as the other Lentz and Miller families.
Our Miller cousin has some Lentz in one of his lines. That is less likely, because his genealogy is pretty well fleshed out, although certainly possible because, once again, the families were living within close proximity and attending the same churches, etc.
This segment is truly a population based segment and will be found in people descending from that same base population. If this is the case, we still received it from one of our ancestors who came from that population, but since the Lentz and Miller lines may have both carried this same segment, we can’t tell who it came from. In other words, their common ancestor is further back in time than the Lentz and Miller families found in the US.

This segment cannot be IBS by chance because it does triangulate with the three cousins, Barbara, Don and Cheryl. The definition of IBS by chance shows us that chance segments would not phase (or match with) with a parent. If Don, Cheryl and Barbara all three carry this matching segment, it’s because their fathers both received it from their grandparents who were the common ancestor of Don, Cheryl and Barbara.

Neither Cheryl, Don nor Barbara can phase directly to their parents, who are deceased, so in this case, matching against first cousins is the best substitute we have. We know that common DNA between the first cousins had to come from their father’s, who were brothers. This in essence virtually phases Barbara, Don and Cheryl to their father’s on these matching segments. Not ideal, by any means, but even partial parental phasing is better than no phasing at all.

A third match, Dean, shows Miller in his family tree, but I could not connect his Miller line to the Johann Michael Miller ancestral line, from which our Miller line descends – so Dean is not a known cousin. Sometimes a common surname, even if found in the same geographic location, is not proof that the DNA connection is through that line. It’s easy to make that assumption, but it’s an assumption that is just waiting to bite you. Don’t do it!

Because of our known, proven DNA and genealogy matches to Herbert, we can attribute all of the segments where Herbert triangulates with either Barbara and Cheryl or Barbara and Don as Miller for all people involved. This means that this common DNA descends either from Daniel Miller and Elizabeth Ulrich or Daniel’s father Philip Jacob Miller and Magdalene, surname unknown.

Why have I listed two couples? Because, remember, Herbert has a double dose of Miller DNA from cousins and we don’t know which segment Barbara inherited, one from Daniel/Elizabeth or one from Philip Jacob/Magdalene (or some of each.) If the segment is from Daniel/Elizabeth, it could have come from either the Ulrich or Miller side. If it came from Daniel, then it also came from his father and mother, Philip Jacob/Magdalena and could either be Miller or Magdalena’s unknown line.

Because of our known, proven DNA and genealogy matches to Rex, we can attribute all of the segments where Rex triangulates with either Barbara and Cheryl or Barbara and Don as Miller for all people involved. Their common ancestor is John David Miller and Margaret Lentz, so their shared DNA could be either Lentz or Miller and is likely some of each.

For segments where there is no triangulation, but Barbara matches either Herbert or Rex, I still note that segment as Miller on my spreadsheet, since they are proven cousins, but I just omit the triangulation note.

For Barbara, that’s a total of 51 segments of her DNA that we can now assign to a Miller ancestral couple.

Furthermore, every segment that Barbara matches with either Cheryl or Don is now confirmed to be from her father’s side of the family, not her mother’s. While we don’t have Barbara’s parents available for testing, this is a pseudo way to phase your results to determine matches from one parents’ side of the family. For Barbara, that’s a total of 91 segments, some of them quite large. For example, roughly half of chromosome 13 matched with Don.

Just as a matter of interest, within those 91 segments that Barbara matches with either Don or Cheryl, a total of only 7 segments matched exactly between all 3 individuals in terms of start and end location, cMs and SNPs. While you might expect a number of small segments to match exactly, these weren’t all small. In fact, most weren’t small and some were quite large.

Exactly matching DNA segments between Barbara and Cheryl and Barbara and Don.

Chromosome	Matching cM	Matching SNPs
1	8.65	1189
1	7.01	1150
8	27.79	7279
10	20.78	5141
12	27.68	6046
14	2.11	700
14	49.47	9032

This means that these segments were not divided at all in a total of 5 DNA transmission events.

Hiram to John
Hiram to Roscoe
John to Barbara
Roscoe to Cheryl
Roscoe to Don

Additionally, I carry two of these exact segments as well, so those two segments survived 6 transmission events.

Clearly these segments are what we would term “sticky” because they certainly are not following the statistical average of dividing the DNA in half (by 50%) in each transmission event.

There is one more thing we can tell from matching.

Both Barbara and Cheryl match with SB on the X chromosome on the same segments.

This is particularly interesting because of the special inheritance path of the X chromosome. We know that SB must be related on Evaline Miller’s side of the family, because John and Roscoe Ferverda did not receive an X chromosome from their father. So Barbara, Cheryl and Don have to have received it from Evaline. Unfortunately, SB listed no genealogy on Family Tree DNA, but based on the X chromosome inheritance path, I can tell you that SB is either descended from John David Miller and Margaret Lentz, or from the Schaeffer, Lentz or Moselman lines colored pink or blue, below.

At this point, I made a chart of how the matches grouped with each other on each of the green clusters.

I intended to create a nice chart in Excel or Word, but with all of the various colors of ink involved, I didn’t think I could find enough color differentiation so we’ll just have to suffer with my hand-made chart. There are subtle color differences here – a different color or marker type for each of the 19 green clusters.

What I did was to look at each of the green DNA spreadsheet groupings and create a colorized chart, by group, for each grouping. So everyone in the first cluster had their X in the boxes of who they matches in the same color, say blue pen. The second group, orange marker, and so forth. That way I can see who was orange or yellow or blue and if those groups tend to cluster together.

Remember Arthur and Eric from above, whose genealogy we knew nothing about. You can see, for example, that Arthur matches in various groups with lots of people, and most often, Tiffany. Arthur and Eric also match in multiple groups that include each other and Rex, a known Miller descendant, so we can attribute both Arthur and Eric’s DNA matches to the Miller side of the tree. Keep in mind, all of these people also match with Barbara, Cheryl and Don.

Tiffany clusters with Arthur and Sarah and Eric in multiple groups and with Constance, David, Ellen, Leland and Rex in at least one other cluster. So another Miller side person.

On chromosome 14, Eric, Ellen, Arthur and Tiffany were all triangulated on the same segment with Don, Cheryl and Barbara, so we know those 7 individuals unquestionably share a common ancestor.

Let’s look at SB again, our X match. Since SB’s X connection can’t come from the Miller side, given the X inheritance path, and SB also matches with our Lentz cousin, it’s likely that SB is related through the Lentz lines.

Normally, when doing this matching relationship chart, you tend to see two distinct groupings, a mother’s side and a father’s side. In other words, there will be some groups that absolutely don’t overlap with the others. That’s not the case here.

So, by now you might be wondering what happened to the Ferverda side of the family? I was secretly hoping to find a closet Ferverda relative in this exercise, and I thought we might have, actually. Notice that Harold has no clustering at all, but he clearly matches Barbara, Cheryl and Don – but doesn’t cluster with any other Miller or Lentz cousins. Therefore, he could be from the Ferverda side of the family, but since he provided no genealogy information or surnames at Family Tree DNA, I can’t easily tell.

However, I am not entirely without recourse. I checked Harold “in common with” Barbara and discovered that he matches both Rex, our Miller cousin and William, our Lentz cousin, so even though Harold did not triangulate with William and/or Rex on any segments with both Barbara and/or Cheryl/Don, those Miller/Lentz matches certainly suggest descent from this line. I’ll be sending him an e-mail!

So, there are no Ferverda cousins represented in these matches.

I decided to check one more thing, now that I know that all of these matches are on the Miller side and that we have 3 known, proven genealogical cousins, Rex, Herbert and William. I wanted to see how many of our individuals who match Barbara, Cheryl and Don also match one of the known cousins. I selected Barbara as the base match kit to use, since we know they all matched Barbara, Cheryl and Don, and then I ran “in common with” for each one of them with Barbara, with the following results. A few did match one of the Miller or Lentz cousins, but fewer than I expected.

*However, we had a surprise. Dean matched another Miller male individual whose line is proven to descend through two children of Philip Jacob Miller and Magdalena, surname unknown. Another first cousin marriage. Another cousin discovered!

Furthermore, I noticed yet another individual, Doug, in Barbara’s match list and in common with 6 of the matches as well. Looking at Doug’s pedigree chart, not only is he a Miller descendant, he also descends from two of the Miller wives lines too. Another cousin confirmed!

But why no Ferverda matches?

Recent immigrants.

The Ferverda side of the family immediately jumps the pond to Holland, with Hiram himself being an immigrant as a young teen in the 1860s. There are few Ferverda (Fervida, Ferwerda) descendants here in the US to test, and many are Brethren or Mennonite. Few people in the Netherlands have participated in DNA testing.

The converse of that, Evaline Miller’s lines have all been in the US since the early/mid-1700s, so there are lots of descendants. Oh, the difference about a hundred years and 5 or 6 generations makes in the number of descendants who might be available to test. This situation, unfortunately, created a very lopsided chart without the division I’m used to seeing. On the other hand, thank goodness Evaline’s line and Hiram’s line are very distinct!

At this point, if you’re doing this “one cousin” exercise, you’ll need to do a few things.

1. Check each of the matching individuals to see if they have uploaded or created a pedigree chart at Family Tree DNA. If they do, their pedigree icon will be green, shown below. If so, click on the icon and search for every surname (and variant) associated with your known common lines with your cousin.

2. Check to see if these people entered a list of surnames, even if they don’t have a pedigree chart. The surnames are listed in the furthest right column. If you have entered your surnames, any that match yours will be bolded. Beware of variant spellings.

You can see above that I am the only one of the matches shown with a pedigree chart icon, shown in green, and the common surnames are bolded at right.

3. If your matches don’t have a pedigree chart, write to them and tell them you have a common ancestor and give them a list of your ancestors in your direct line. Please, PLEASE include the name on the kit that you match. Many people manage multiple kits and will ignore requests with only partial information.

4. If you have additional cousins to test, do so. I’m sure you can see how valuable additional cousins DNA would be.

5. Be sure to check your matches by “ancestral surname” to be sure that you haven’t missed any cousins who have already tested. The ancestral surname search box can be seen above the “known relationship” heading in the graphic above.

6. If you haven’t done so, enter your surnames under the “Manage Personal Information” tab under “My Account” at Family Tree DNA. Then click on the genealogy tab, then Surnames.

7. From your main personal page, of course, you can upload your Gedcom file by clicking on “My Family Tree.”

8. Run “in common with” for each of the common matches of your two cousins and look for common matching names between them. Those matching “in common with” names serve as a hint as to shared ancestry. Your answer may be hiding in your cousins’ trees!

Utilize all of these tools to help your search.

Summary

Not bad for thinking we couldn’t do anything with our DNA matches because we had “just one cousin” to work with, even though I cheated and used siblings.

What, exactly, did we manage to do?

I attributed 91 segments of Barbara’s DNA to her father’s side of the tree.
I filled in 51 segments of Barbara’s DNA to ancestral couples.
I found 5 confirmed genealogy/DNA cousins.
I found 16 people whose genealogy is unknown, but who triangulate with Barbara, Cheryl and Don. We know for sure which side of the tree these people match on – all Millers.
I can tell the X match which lines they descend from, even if they don’t know.
I can do one more very cool thing. Utilizing the Lazarus utility at GedMatch, I can now recreate at least a partial autosomal DNA file for both John and Roscoe Ferverda, the fathers of our testers. Join me in a couple days and we’ll see how that works!

This same process works between any two people who know how they are related and their common ancestor. It’s a great way to find cousins you didn’t know you had, or you didn’t know have DNA tested, and how they are related to you and each other.

Some people get very discouraged when even thinking about working with endogamous populations, or cousin marriages. One of the reasons I used this particular example is that I wanted to illustrate that while these situations are challenging from time to time, they are far from hopeless – so don’t let that deter you.

In fact, of the 5 confirmed cousins discovered during this process, some in unexpected ways, at least 3 and possibly 4 are through multiple lines. Some of these matches are probably thanks to endogamy.

Happy hunting!

______________________________________________________________

Disclosure

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Legacy Tree Genealogists for genealogy research

Chromosome Browser War

Posted on November 30, 2014 by Roberta Estes

There has been a lot of discussion lately, and I mean REALLY a lot, about chromosome browsers, the need or lack thereof, why, and what the information really means.

For the old timers in the field, we know the story, the reasons, and the backstory, but a lot of people don’t. Not only are they only getting pieces of the puzzle, they’re confused about why there even is a puzzle. I’ve been receiving very basic questions about this topic, so I thought I’d write an article about chromosome browsers, what they do for us, why we need them, how we use them and the three vendors, 23andMe, Ancestry and Family Tree DNA, who offer autosomal DNA products that provide a participant matching data base.

The Autosomal Goal

Autosomal DNA, which tests the part of your DNA that recombines between parents every generation, is utilized in genetic genealogy to do a couple of things.

To confirm your connection to a specific ancestor through matches to other descendants.
To break down genealogy brick walls.
Determine ethnicity percentages which is not the topic of this article.

The same methodology is used for items 1 and 2.

In essence, to confirm that you share a common ancestor with someone, you need to either:

Be a close relative – meaning you tested your mother and/or father and you match as expected. Or, you tested another known relative, like a first cousin, for example, and you also match as expected. These known relationships and matches become important in confirming or eliminating other matches and in mapping your own chromosomes to specific ancestors.
A triangulated match to at least two others who share the same distant ancestor. This happens when you match other people whose tree indicates that you share a common ancestor, but they are not previously known to you as family.

Triangulation is the only way you can prove that you do indeed share a common ancestor with someone not previously identified as family.

In essence, triangulation is the process by which you match people who match you genetically with common ancestors through their pedigree charts. I wrote about the process in this article “Triangulation for Autosomal DNA.”

To prove that you share a common ancestor with another individual, the DNA of three proven descendants of that common ancestor must match at the same location. I should add a little * to this and the small print would say, “ on relatively large segments.” That little * is rather controversial, and we’ll talk about that in a little bit. This leads us to the next step, which is if you’re a fourth person, and you match all three of those other people on that same segment, then you too share that common ancestor. This is the process by which adoptees and those who are searching for the identity of a parent work through their matches to work forward in time from common ancestors to, hopefully, identify candidates for individuals who could be their parents.

Why do we need to do this? Isn’t just matching our DNA and seeing a common ancestor in a pedigree chart with one person enough? No, it isn’t. I recently wrote about a situation where I had a match with someone and discovered that even though we didn’t know it, and still don’t know exactly how, we unquestionably share two different ancestral lines.

When you look at someone’s pedigree chart, you may see immediately that you share more than one ancestral line. Your shared DNA could come from either line, both lines, or neither line – meaning from an unidentified common ancestor. In genealogy parlance, those are known as brick walls!

Blaine Bettinger wrote about this scenario in his now classic article, “Everyone Has Two Family Trees – A Genealogical Tree and a Genetic Tree.”

Proving a Match

The only way to prove that you actually do share a genealogy relative with someone that is not a known family member is to triangulate. This means searching other matches with the same ancestral surname, preferably finding someone with the same proven ancestral tree, and confirming that the three of you not only share matching DNA, but all three share the same matching DNA segments. This means that you share the same ancestor.

Triangulation itself is a two-step process followed by a third step of mapping your own DNA so that you know where various segments came from. The first two triangulation steps are discovering that you match other people on a common segment(s) and then determining if the matches also match each other on those same segments.

Both Family Tree DNA and 23andMe, as vendors have provided ways to do most of this. www.gedmatch.com and www.dnagedcom.com both augment the vendor offerings. Ancestry provides no tools of this type – which is, of course, what has precipitated the chromosome browser war.

Let’s look at how the vendors products work in actual practice.

Family Tree DNA

1. Chromosome browser – do they match you?

Family Tree DNA makes it easy to see who you match in common with someone else in their matching tool, by utilizing the ICW crossed X icon.

In the above example, I am seeing who I match in common with my mother. Sure enough, our three known cousins are the closest matches, shown below.

You can then push up to 5 individuals through to the chromosome browser to see where they match the participant.

The following chromosome browser is an example of a 4 person match showing up on the Family Tree DNA chromosome browser.

This example shows known cousins matching. But this is exactly the same scenario you’re looking for when you are matching previously unknown cousins – the exact same technique.

In this example, I am the participant, so these matches are matches to me and my chromosome is the background chromosome displayed. I have switched from my mother’s side to known cousins on my father’s side.

The chromosome browser shows that these three cousins all match the person whose chromosomes are being shown (me, in this case), but it doesn’t tell you if they also match each other. With known cousins, it’s very unlikely (in my case) that someone would match me from my mother’s side, and someone from my father’s side, but when you’re working with unknown cousins, it’s certainly possible. If your parents are from the same core population, like Germans or an endogamous population, you may well have people who match you on both sides of your family. Simply put, you can’t assume they don’t.

It’s also possible that the match is a genuine genealogical match, but you don’t happen to match on the exact same segments, so the ancestor can’t yet be confirmed until more cousins sharing that same ancestral line are found who do match, and it’s possible that some segments could be IBS, identical by state, meaning matches by chance, especially small segments, below the match threshold.

2. Matrix – do they match each other?

Family Tree DNA also provides a tool called the Matrix where you can see if all of the people who match on the same segment, also match each other at some place on their DNA.

The Matrix tool measures the same level of DNA as the default chromosome browser, so in the situation I’m using for an example, there is no issue. However, if you drop the threshold of the match level, you may well, and in this case, you will, find matches well below the match threshold. They are shown as matches because they have at least one segment above the match threshold. If you don’t have at least one segment above the threshold, you’ll never see these smaller matches. Just to show you what I mean, this is the same four people, above, with the threshold lowered to 1cM. All those little confetti pieces of color are smaller matches.

At Family Tree DNA, the match threshold is about 7cM. Each of the vendors has a different threshold and a different way of calculating that threshold.

The only reason I mention this is because if you DON’T match with someone on the matrix, but you also show matches at smaller segments, understand that matrix is not reporting on those, so matrix matches are not negative proof, only positive indications – when you do match, both on the chromosome browser and utilizing the matrix tool.

What you do know at this point is that these individuals all match you on the same segments, and that they match each other someplace on their chromosomes, but what you don’t know is if they match each other on the same locations where they match you.

If you are lucky and your matches are cousins or experienced genetic genealogists and are willing to take a look at their accounts, they can tell you if they match the other people on the same segments where they match you – but that’s the only way to know unless they are willing to download their raw data file to GedMatch. At GedMatch, you can adjust the match thresholds to any level you wish and you can compare one-to-one kits to see where any two kits who have provided you with their kit number match each other.

3. Downloading data – mapping your chromosome.

The “download to Excel” function at Family Tree DNA, located just above the chromosome browser graphic, on the left, provides you with the matching data of the individuals shown on the chromosome browser with their actual segment data shown. (The download button on the right downloads all of your matches, not just the ones shown in the browser comparison.)

The spreadsheet below shows the downloaded data for these four individuals. You can see on chromosome 15 (yellow) there are three distinct segments that match (pink, yellow and blue,) which is exactly what is reflected on the graphic browser as well.

On the spreadsheet below, I’ve highlighted, in red, the segments which appeared on the original chromosome browser – so these are only the matches at or over the match threshold.

As you can see, there are 13 in total.

Smaller Segments

Up to this point, the process I’ve shared is widely accepted as the gold standard.

In the genetic genealogy community, there are very divergent opinions on how to treat segments below the match threshold, or below even 10cM. Some people “throw them away,” in essence, disregard them entirely. Before we look at a real life example, let’s talk about the challenges with small segments.

When smaller segments match, along with larger segments, I don’t delete them, throw them away, or disregard them. I believe that they are tools and each one carries a message for us. Those messages can be one of four things.

This is a valid IBD, meaning identical by descent, match where the segment has been passed from one specific ancestor to all of the people who match and can be utilized as such.
This is an IBS match, meaning identical by state, and is called that because we can’t yet identify the common ancestor, but there is one. So this is actually IBD but we can’t yet identify it as such. With more matches, we may well be able to identify it as IBD, but if we throw it away, we never get that chance. As larger data bases and more sophisticated software become available, these matches will fall into place.
This is an IBS match that is a false match, meaning the DNA segments that we receive from our father and mother just happen to align in a way that matches another person. Generally these are relatively easy to determine because the people you match won’t match each other. You also won’t tend to match other people with the same ancestral line, so they will tend to look like lone outliers on your match spreadsheets, but not always.
This is an IBS match that is population based. These are much more difficult to determine, because this is a segment that is found widely in a population. The key to determining these pileup areas, as discussed in the Ancestry article about their new phasing technique, if that you will find this same segment matching different proven lineages. This is the reason that Ancestry has implemented phasing – to identify and remove these match regions from your matches. Ancestry provided a graphic of my pileup areas, although they did not identify for me where on my chromosomes these pileup regions occurred. I do have some idea however, because I’ve found a couple of areas where I have matches from my mother’s side of the family from different ancestors – so these areas must be IBS on a population level. That does not, however, make them completely irrelevant.

The challenge, and problem, is where to make the cutoff when you’re eliminating match areas based on phased data. For example, I lost all of my Acadian matches at Ancestry. Of course, you would expect an endogamous population to share lots of the same DNA – and there are a huge number of Acadian descendants today – they are in fact a “population,” but those matches are (were) still useful to me.

I utilize Acadian matches from Family Tree DNA and 23andMe to label that part of my chromosome “Acadian” even if I can’t track it to a specific Acadian ancestor, yet. I do know from which of my mother’s ancestors it originated, her great-grandfather, who is her Acadian ancestor. Knowing that much is useful as well.

The same challenge exists for other endogamous groups – people with Jewish, Mennonite/Brethren/Amish, Native American and African American heritage searching for their mixed race roots arising from slavery. In fact, I’d go so far as to say that this problem exists for anyone looking for ancestors beyond the 5^th or 6^th generation, because segments inherited from those ancestors, if there are any, will probably be small and fall below the generally accepted match thresholds. The only way you will be able to find them, today, is the unlikely event that there is one larger segments, and it leads you on a search, like the case with Sarah Hickerson.

I want to be very clear – if you’re looking for only “sure thing” segments – then the larger the matching segment, the better the odds that it’s a sure thing, a positive, indisputable, noncontroversial match. However, if you’re looking for ancestors in the distant past, in the 5^th or 6^th generation or further, you’re not likely to find sure thing matches and you’ll have to work with smaller segments. It’s certainly preferable and easier to work with large matches, but it’s not always possible.

In the Ralph and Coop paper, The Geography of Recent Genetic Ancestry Across Europe, they indicated that people who matched on segments of 10cM or larger were more likely to have a common ancestor with in the past 500 years. Blocks of 4cM or larger were estimated to be from populations from 500-1500 years ago. However, we also know that there are indeed sticky segments that get passed intact from generation to generation, and also that some segments don’t get divided in a generation, they simply disappear and aren’t passed on at all. I wrote about this in my article titled, Generational Inheritance.

Another paper by Durand et al, Reducing pervasive false positive identical-by-descent segments detected by large-scale pedigree analysis, showed that 67% of the 2-4cM segments were false positives. Conversely, that also means that 33% of the 2-4cM segments were legitimate IBD segments.

Part of the disagreement within the genetic genealogy community is based on a difference in goals. People who are looking for the parents of adoptees are looking first and primarily as “sure thing” matches and the bigger the match segment, of course, the better because that means the people are related more closely in time. For them, smaller segments really are useless. However, for people who know their recent genealogy and are looking for those brick wall ancestors, several generations back in time, their only hope is utilizing those smaller segments. This not black and white but shades of grey. One size does not fit all. Nor is what we know today the end of the line. We learn every single day and many of our learning experiences are by working through our own unique genealogical situations – and sharing our discoveries.

On this next spreadsheet, you can see the smaller segments surrounding the larger segments – in other words, in the same match cluster – highlighted in green. These are the segments that would be discarded as invalid if you were drawing the line at the match threshold. Some people draw it even higher, at 10 cM. I’m not being critical of their methodology or saying they are wrong. It may well work best for them, but discarding small segments is not the only approach and other approaches do work, depending on the goals of the researcher. I want my 33% IBD segments, thank you very much.

All of the segments highlighted in purple match between at least three cousins. By checking the other cousins accounts, I can validate that they do all match each other as well, even though I can’t tell this through the Family Tree DNA matrix below the matching threshold. So, I’ve proven these are valid. We all received them from our common ancestor.

What about the white rows? Are those valid matches, from a common ancestor? We don’t have enough information to make that determination today.

Downloading my data, and confirming segments to this common ancestor allows me to map my own chromosomes. Now, I know that if someone matches me and any of these three cousins on chromosome 15, for example, between 33,335,760 and 58,455,135 – they are, whether they know it or not, descended from our common ancestral line.

In my opinion, I would think it a shame to discount or throw away all of these matches below 7cM, because you would be discounting 39 of your 52 total matches, or 75% of them. I would be more conservative in assigning my segments with only one cousin match to any ancestor, but I would certainly note the match and hope that if I added other cousins, that segment would be eventually proven as IBD.

I used positively known cousins in this example because there is no disputing the validity of these matches. They were known as cousins long before DNA testing.

Breaking Down Brick Walls

This is the same technique utilized to break down brick walls – and the more cousins you have tested, so that you can identify the maximum number of chromosome pieces of a particular ancestor – the better.

I used this same technique to identify Sarah Hickerson in my Thanksgiving Day article, utilizing these same cousins, plus several more.

Hey, just for fun, want to see what chromosome 15 looks like in this much larger sample???

In this case, we were trying to break down a brick wall. We needed to determine if Sarah Hickerson was the mother of Elijah Vannoy. All of the individuals in the left “Name” column are proven Vannoy cousins from Elijah, or in one case, William, from another child of Sarah Hickerson. The individuals in the right “Match” column are everyone in the cousin match group plus the people in green who are Hickerson/Higginson descendants. William, in green, is proven to descend from Sarah Hickerson and her husband, Daniel Vannoy.

The first part of chromosome 15 doesn’t overlap with the rest. Buster, David and I share another ancestral line as well, so the match in the non-red section of chromosome 15 may well be from that ancestral line. It becomes an obvious possibility, because none of the people who share the Vannoy/Hickerson/Higginson DNA are in that small match group.

All of the red colored cells do overlap with at least one other individual in that group and together they form a cluster. The yellow highlighted cells are the ones over the match threshold. The 6 Hickerson/Higginson descendants are scattered throughout this match group.

And yes, for those who are going to ask, there are many more Vannoy/Hickerson triangulated groups. This is just one of over 60 matching groups in total, some with matches well above the match threshold. But back to the chromosome browser wars!

23andMe

This example from 23andMe shows why it’s so very important to verify that your matches also match each other.

Blue and purple match segments are to two of the same cousins that I used in the comparison at Family Tree DNA, who are from my father’s side. Green is my first cousin from my mother’s side. Note that on chromosome 11, they both match me on a common segment. I know by working with them that they don’t match each other on that segment, so while they are both related to me, on chromosome 11, it’s not through the same ancestor. One is from my father’s side and one is from my mother’s side. If I hadn’t already known that, determining if they matched each other would be the acid test and would separate them into 2 groups.

23andMe provides you with a tool to see who your matches match that you match too. That’s a tongue twister.

In essence, you can select any individual, meaning you or anyone that you match, on the left hand side of this tool, and compare them to any 5 other people that you match. In my case above, I compared myself to my cousins, but if I want to know if my cousin on my mother’s side matches my two cousins on my father’s side, I simply select her name on the left and theirs on the right by using the drop down arrows.

I would show you the results, but it’s in essence a blank chromosome browser screen, because she doesn’t match either of them, anyplace, which tells me, if I didn’t already know, that these two matches are from different sides of my family.

However, in other situations, where I match my cousin Daryl, for example, as well as several other people on the same segment, I want to know how many of these people Daryl matches as well. I can enter Daryl’s name, with my name and their names in the group of 5, and compare. 23andMe facilitates the viewing or download of the results in a matrix as well, along with the segment data. You can also download your entire list of matches by requesting aggregated data through the link at the bottom of the screen above or the bottom of the chromosome display.

I find it cumbersome to enter each matches name in the search tool and then enter all of the other matches names as well. By utilizing the tools at www.dnagedcom.com, you can determine who your matches match as well, in common with you, in one spreadsheet. Here’s an example. Daryl in the chart below is my match, and this tool shows you who else she matches that I match as well, and the matching segments. This allows me to correlate my match with Gwen for example, to Daryl’s match to Gwen to see if they are on the same segments.

As you can see, Daryl and I both match Gwen on a common segment. On my own chromosome mapping spreadsheet, I match several other people as well at that location, at other vendors, but so far, we haven’t been able to find any common genealogy.

Ancestry.com

At Ancestry.com, I have exactly the opposite problem. I have lots of people I DNA match, and some with common genealogy, but no tools to prove the DNA match is to the common ancestor.

Hence, this is the crux of the chromosome browser wars. I’ve just showed you how and why we use chromosome browsers and tools to show if our matches match each other in addition to us and on which segments. I’ve also illustrated why. Neither 23andMe nor Family Tree DNA provides perfect tools, which is why we utilize both GedMatch and DNAGedcom, but they do provide tools. Ancestry provides no tools of this type.

At Ancestry, you have two kinds of genetic matches – ones without tree matches and ones with tree matches. Pedigree matching is a service that Ancestry provides that the other vendors don’t. Unfortunately, it also leads people to believe that because they match these people genetically and share a tree, that the tree shown is THE genetic match and it’s to the ancestor shown in the tree. In fact, if the tree is wrong, either your tree or their tree, and you match them genetically, you will show up as a pedigree match as well. Even if both pedigrees are right, that still doesn’t mean that your genetic match is through that ancestor.

How many bad trees are at Ancestry percentagewise? I don’t know, but it’s a constant complaint and there is absolutely nothing Ancestry can do about it. All they can do is utilize what they have, which is what their customers provide. And I’m glad they do. It does make the process of working through your matches much easier. It’s a starting point. DNA matches with trees that also match your pedigree are shown with Ancestry’s infamous shakey leaf.

In fact, in my Sarah Hickerson article, it was a shakey leaf match that initially clued me that there was something afoot – maybe. I had to shift to another platform (Family Tree DNA) to prove the match however, where I had tools and lots of known cousins.

At Ancestry, I now have about 3000 matches in total, and of those, I have 33 shakey leaves – or people with whom I also share an ancestor in our pedigree charts. A few of those are the same old known cousins, just as genealogy crazy as me, and they’ve tested at all 3 companies.

The fly in the ointment, right off the bat, is that I noticed in several of these matches that I ALSO share another ancestral line.

Now, the great news is that Ancestry shows you your surnames in common, and you can click on the surname and see the common individuals in both trees.

The bad news is that you have to notice and click to see that information, found in the lower left hand corner of this screen.

In this case, Cook is an entirely different line, not connected to the McKee line shown.

However, in this next case, we have the same individual entered in our software, but differently. It wasn’t close enough to connect as an ancestor, but close enough to note. It turns out that Sarah Cook is the mother of Fairwick Claxton, but her middle name was not Helloms, nor was her maiden name, although that is a long-standing misconception that was proven incorrect with her husband’s War of 1812 documents many years ago. Unfortunately, this misinformation is very widespread in trees on the internet.

Out of curiosity, and now I’m sorry I did this because it’s very disheartening – I looked to see what James Lee Claxton/Clarkson’s wife’s name was shown to be on the first page of Ancestry’s advanced search matches.

Despite extensive genealogical and DNA research, we don’t know who James Lee Claxton/Clarkson’s parents are, although we’ve disproven several possibilities, including the most popular candidate pre-DNA testing. However, James’ wife was positively Sarah Cook, as given by her, along with her father’s name, and by witnesses to their marriage provided when she applied for a War of 1812 pension and bounty land. I have the papers from the National Archives.

James Lee Claxton’s wife, Sara Cook is identified as follows in the first 50 Ancestry search entries.

Sarah Cook – 4

Incorrect entries:

Sarah Cook but with James’ parents listed – 3
Sarah Helloms Cook – 2, one with James’ parents
Sarah Hillhorns – 15
Sarah Cook Hitson – 13, some with various parents for James
No wife, but various parents listed for James – 12
No wife, no parents – 1

I’d much rather see no wife and no parents than incorrect information.

Judy Russell has expressed her concern about the effects of incorrect trees and DNA as well and we shared this concern with Ancestry during our meeting.

Ancestry themselves in their paper titled “Identifying groups of descendants using pedigrees and genetically inferred relationships in a large database” says, “”As with all analyses relating to DNA Circles™, tree quality is also an important caveat and limitation.” So Ancestry is aware, but they are trying to leverage and utilize one of their biggest assets, their trees.

This brings us to DNA Circles. I reviewed Ancestry’s new product release extensively in my Ancestry’s Better Mousetrap article. To recap briefly, Ancestry gathers your DNA matches together, and then looks for common ancestors in trees that are public using an intelligent ranking algorithm that takes into account:

The confidence that the match is due to recent genealogical history (versus a match due to older genealogical history or a false match entirely).
The confidence that the identified common recent ancestor represents the same person in both online pedigrees.
The confidence that the individuals have a match due to the shared ancestor in question as opposed to from another ancestor or from more distant genealogical history.

The key here is that Ancestry is looking for what they term “recent genealogical history.” In their paper they define this as 10 generations, but the beta version of DNA Circles only looks back 7 generations today. This was also reflected in their phasing paper, “Discovering IBD matches across a large, growing database.”

However, the unfortunate effect has been in many cases to eliminate matches, especially from endogamous groups. By way of example, I lost my Acadian matches in the Ancestry new product release. They would have been more than 7 generations back, and because they were endogamous, they may have “looked like” IBS segments, if IBS is defined at Ancestry as more than 7 or 10 generations back. Hopefully Ancestry will tweek this algorithm in future releases.

Ancestry, according to their paper, “Identifying groups of descendants using pedigrees and genetically inferred relationships in a large database,” then clusters these remaining matching individuals together in Circles based on their pedigree charts. You will match some of these people genetically, and some of them will not match you but will match each other. Again, according to the paper, “these confidence levels are calculated by the direct-line pedigree size, the number of shared ancestral couples and the generational depth of the shared MRCA couple.”

Ancestry notes that, “using the concordance of two independent pieces of information, meaning pedigree relationships and patterns of match sharing among a set of individuals, DNA Circles can serve as supporting evidence for documented pedigree lines.” Notice, Ancestry did NOT SAY proof. Nothing that Ancestry provides in their DNA product constitutes proof.

Ancestry continues by saying that Circles “opens the possibility for people to identify distant relatives with whom they do not share DNA directly but with whom they still have genetic evidence supporting the relationship.”

In other words, Ancestry is being very clear in this paper, which is provided on the DNA Circles page for anyone with Circles, that they are giving you a tool, not “the answer,” but one more piece of information that you can consider as evidence.

You can see in my Joel Vannoy circle that I match both of these people both genetically and on their tree.

We, in the genetic genealogy community, need proof. It certainly could be available, technically – because it is with other vendors and third party sites.

We need to be able to prove that our matches also match each other, and utilizing Ancestry’s tools, we can’t. We also can’t do this at Ancestry by utilizing third party tools, so we’re in essence, stuck.

We can either choose to believe, without substantiation, that we indeed share a common ancestor because we share DNA segments with them plus a pedigree chart from that common ancestor, or we can initiate a conversation with our match that leads to either or both of the following questions:

Have you or would you upload your raw data to GedMatch?
Have you or would you upload your raw data file to Family Tree DNA?

Let the begging begin!!!

The Problem

In a nutshell, the problem is that even if your Ancestry matches do reply and do upload their file to either Family Tree DNA or GedMatch or both, you are losing most of the potential information available, or that would be available, if Ancestry provided a chromosome browser and matrix type tool.

In other words, you’d have to convince all of your matches and then they would have to convince all of the matches in the circle that they match and you don’t to upload their files.

Given that, of the 44 private tree shakey leaf matches that I sent messages to about 2 weeks ago, asking only for them to tell me the identity of our common pedigree ancestor, so far 2 only of them have replied, the odds of getting an entire group of people to upload files is infinitesimal. You’d stand a better chance of winning the lottery.

One of the things Ancestry excels at is marketing.

If you’ve seen any of their ads, and they are everyplace, they focus on the “feel good” and they are certainly maximizing the warm fuzzy feelings at the holidays and missing those generations that have gone before us.

This is by no means a criticism, but it is why so many people do take the Ancestry DNA test. It’s advertised as easy and you’ll learn more about your family. And you do, no question – you learn about your ethnicity and you get a list of DNA matches, pedigree matches when possible and DNA Circles.

The list of what you don’t get is every bit as important, a chromosome browser and tools to see whether your matches also match each other. However, most of their customers will never know that.

Judging by the high percentage of inaccurate trees I found at Ancestry in my little experiment relative to the known and documented wife’s name of James Lee Claxton, which was 96%, based on just the first page of 50 search matches, it would appear that about 96% of Ancestry’s clientele are willing to believe something that someone else tells them without verification. I doubt that it matters whether that information is a tree or a DNA test where they are shown matches with common pedigree charts and circles. I don’t mean this to be critical of those people. We all began as novices and we need new people to become interested in both genealogy and DNA testing.

I suspect that most of Ancestry’s clients, especially new ones, simply don’t have a clue that there is a problem, let alone the magnitude and scope. How would they? They are just happy to find information about their ancestor. And as someone said to me once – “but there are so many of those trees (with a wrong wife’s name), how can they all be wrong?” Plus, the ads, at least some of them, certainly suggest that the DNA test grows your family tree for you.

The good news in all of this is that Ancestry’s widespread advertising has made DNA testing just part of the normal things that genealogists do. Their marketing expertise along with recent television programs have served to bring DNA testing into the limelight. The bad news is that if people test at Ancestry instead of at a vendor who provides tools, we, and they, lose the opportunity to utilize those results to their fullest potential. We, and they, lose any hope of proving an ancestor utilizing DNA. And let’s face it, DNA testing and genealogy is about collaboration. Having a DNA test that you don’t compare against others is pointless for genealogy purposes.

When a small group of bloggers and educators visited Ancestry in October, 2014, for what came to be called DNA Day, we discussed the chromosome browser and Ancestry’s plans for their new DNA Circles product, although it had not yet been named at that time. I wrote about that meeting, including the fact that we discussed the need for a chromosome browser ad nauseum. Needless to say, there was no agreement between the genetic genealogy community and the Ancestry folks.

When we discussed the situation with Ancestry they talked about privacy and those types of issues, which you can read about in detail in that article, but I suspect, strongly, that the real reason they aren’t keen on developing a chromosome browser lies in different areas.

Ancestry truly believes that people cannot understand and utilize a chromosome browser and the information it provides. They believe that people who do have access to chromosome browsers are interpreting the results incorrectly today.
They do not want to implement a complex feature for a small percentage of their users…the number bantered around informally was 5%…and I don’t know if that was an off-the-cuff number or based on market research. However, if you compare that number with the number of accurate versus inaccurate pedigree charts in my “James Claxton’s wife’s name” experiment, it’s very close…so I would say that the 5% number is probably close to accurate.
They do not want to increase their support burden trying to explain the results of a chromosome browser to the other 95%. Keep in mind the number of users you’re discussing. They said in their paper they had 500,000 DNA participants. I think it’s well over 700,000 today, and they clearly expect to hit 1 million in 2015. So if you utilize a range – 5% of their users are 25,000-50,000 and 95% of their users are 475,000-950,000.
Their clients have already paid their money for the test, as it is, and there is no financial incentive for Ancestry to invest in an add-on tool from which they generate no incremental revenue and do generate increased development and support costs. The only benefit to them is that we shut up!

So, the bottom line is that most of Ancestry’s clients don’t know or care about a chromosome browser. There are, however, a very noisy group of us who do.

Many of Ancestry’s clients who purchase the DNA test do so as an impulse purchase with very little, if any, understanding of what they are purchasing, what it can or will do for them, at Ancestry or anyplace else, for that matter.

Any serious genealogist who researched the autosomal testing products would not make Ancestry their only purchase, especially if they could only purchase one test. Many, if not most, serious genealogists have tested at all three companies in order to fish in different ponds and maximize their reach. I suspect that most of Ancestry’s customers are looking for simple and immediate answers, not tools and additional work.

The flip side of that, however, if that we are very aware of what we, the genetic genealogy industry needs, and why, and how frustratingly lacking Ancestry’s product is.

Company Focus

It’s easy for us as extremely passionate and focused consumers to forget that all three companies are for-profit corporations. Let’s take a brief look at their corporate focus, history and goals, because that tells a very big portion of the story. Every company is responsible first and foremost to their shareholders and owners to be profitable, as profitable as possible which means striking the perfect balance of investment and expenditure with frugality. In corporate America, everything has to be justified by ROI, or return on investment.

Family Tree DNA

Family Tree DNA was the first one of the companies to offer DNA testing and was formed in 1999 by Bennett Greenspan and Max Blankfeld, both still principles who run Family Tree DNA, now part of Gene by Gene, on a daily basis. Family Tree DNA’s focus is only on genetic genealogy and they have a wide variety of products that produce a spectrum of information including various Y DNA tests, mitochondrial, autosomal, and genetic traits. They are now the only commercial company to offer the Y STR and mitochondrial DNA tests, both very important tools for genetic genealogists, with a great deal of information to offer about our ancestors.

In April 2005, National Geographic’s Genographic project was announced in partnership with Family Tree DNA and IBM. The Genographic project, was scheduled to last for 5 years, but is now in its 9^th year. Family Tree DNA and National Geographic announced Geno 2.0 in July of 2012 with a newly designed chip that would test more than 12,000 locations on the Y chromosome, in addition to providing other information to participants.

The Genographic project provided a huge boost to genetic genealogy because it provided assurance of legitimacy and brought DNA testing into the living room of every family who subscribed to National Geographic magazine. Family Tree DNA’s partnership with National Geographic led to the tipping point where consumer DNA testing became mainstream.

In 2011 the founders expanded the company to include clinical genetics and a research arm by forming Gene by Gene. This allowed them, among other things, to bring their testing in house by expanding their laboratory facilities. They have continued to increase their product offerings to include sophisticated high end tests like the Big Y, introduced in 2013.

23andMe

23andMe is also privately held and began offering testing for medical and health information in November 2007, initially offering “estimates of predisposition for more than 90 traits ranging from baldness to blindness.” Their corporate focus has always been in the medical field, with aggregated customer data being studied by 23andMe and other researchers for various purposes.

In 2009, 23andMe began to offer the autosomal test for genealogists, the first company to provide this service. Even though, by today’s standards, it was very expensive, genetic genealogists flocked to take this test.

In 2013, after several years of back and forth with 23andMe ultimately failing to reply to the FDA, the FDA forced 23andMe to stop providing the medical results. Clients purchasing the 23andMe autosomal product since November of 2013 receive only ethnicity results and the genealogical matching services.

In 2014, 23andMe has been plagued by public relations issues and has not upgraded significantly nor provided additional tools for the genetic genealogy community, although they recently formed a liaison with My Heritage.

23andMe is clearly focused on genetics, but not primarily genetic genealogy, and their corporate focus during this last year in particular has been, I suspect, on how to survive, given the FDA action. If they steer clear of that landmine, I expect that we may see great things in the realm of personalized medicine from them in the future.

Genetic genealogy remains a way for them to attract people to increase their data base size for research purposes. Right now, until they can again begin providing health information, genetic genealogists are the only people purchasing the test, although 23andMe may have other revenue sources from the research end of the business

Ancestry.com

Ancestry.com is a privately held company. They were founded in the 1990s and have been through several ownership and organizational iterations, which you can read about in the wiki article about Ancestry.

During the last several years, Ancestry has purchased several other genealogy companies and is now the largest for-profit genealogy company in the world. That’s either wonderful or terrible, depending on your experiences and perspective.

Ancestry has had an on-again-off-again relationship with DNA testing since 2002, with more than one foray into DNA testing and subsequent withdrawal from DNA testing. If you are interested in the specifics, you can read about them in this article.

Ancestry’s goal, as it is with all companies, is profitability. However, they have given themselves a very large black eye in the genetic genealogy community by doing things that we consider to be civically irresponsible, like destroying the Y and mitochondrial DNA data bases. This still makes no sense, because while Ancestry spends money on one hand to acquire data bases and digitize existing records, on the other hand, they wiped out a data base containing tens of thousands of irreplaceable DNA records, which are genealogy records of a different type. This was discussed at DNA Day and the genetic genealogy community retains hope that Ancestry is reconsidering their decision.

Ancestry has been plagued by a history of missteps and mediocrity in their DNA products, beginning with their Y and mitochondrial DNA products and continuing with their autosomal product. Their first autosomal release included ethnicity results that gave many people very high percentages of Scandinavian heritage. Ancestry never acknowledged a problem and defended their product to the end…until the day when they announced an update titled….a whole new you. They are marketing geniuses. While many people found their updated product much more realistic, not everyone was happy. Judy Russell wrote a great summary of the situation.

It’s difficult, once a company has lost their credibility, for them to regain it.

I think Ancestry does a bang up job of what their primary corporate goal is….genealogy records and subscriptions for people to access those records. I’m a daily user. Today, with their acquisitions, it would be very difficult to be a serious genealogist without an Ancestry subscription….which is of course what their corporate goal has been.

Ancestry does an outstanding job of making everything look and appear easy. Their customer interface is intuitive and straightforward, for the most part. In fact, maybe they have made both genealogy and genetic genealogy look a little too easy. I say this tongue in cheek, full well knowing that the ease of use is how they attract so many people, and those are the same people who ultimately purchase the DNA tests – but the expectation of swabbing and the answer appearing is becoming a problem. I’m glad that Ancestry has brought DNA testing to so many people but this success makes tools like the chromosome browser/matrix that much more important – because there is so much genealogy information there just waiting to be revealed. I also feel that their level of success and visibility also visits upon them the responsibility for transparency and accuracy in setting expectations properly – from the beginning – with the ads. DNA testing does not “grow your tree” while you’re away.

I’m guessing Ancestry entered the DNA market again because they saw a way to sell an additional product, autosomal DNA testing, that would tie people’s trees together and provide customers with an additional tool, at an additional price, and give them yet another reason to remain subscribed every year. Nothing wrong with that either. For the owners, a very reasonable tactic to harness a captive data base whose ear you already have.

But Ancestry’s focus or priority is not now, and never has been, quality, nor genetic genealogy. Autosomal DNA testing is a tool for their clients, a revenue generation source for them, and that’s it. Again, not a criticism. Just the way it is.

In Summary

As I look at the corporate focus of the three players in this space, I see three companies who are indeed following their corporate focus and vision. That’s not a bad thing, unless the genetic genealogy community focus finds itself in conflict with the results of their corporate focus.

It’s no wonder that Family Tree DNA sponsors events like the International DNA Conference and works hand in hand with genealogists and project administrators. Their focus is and always has been genetic genealogy.

People do become very frustrated with Family Tree DNA from time to time, but just try to voice those frustrations to upper management at either 23andMe or Ancestry and see how far you get. My last helpdesk query to 23andMe submitted on October 24^th has yet to receive any reply. At Family Tree DNA, I e-mailed the project administrator liaison today, the Saturday after Thanksgiving, hoping for a response on Monday – but I received one just a couple hours later – on a holiday weekend.

In terms of the chromosome browser war – and that war is between the genetic genealogy community and Ancestry.com, I completely understand both positions.

The genetic genealogy community has been persistent, noisy, and united. Petitions have been created and signed and sent to Ancestry upper management. To my knowledge, confirmation of any communications surrounding this topic with the exception of Ancestry reaching out to the blogging and education community, has never been received.

This lack of acknowledgement and/or action on the issues at hand frustrates the community terribly and causes reams of rather pointed and very direct replies to Anna Swayne and other Ancestry employees who are charged with interfacing with the public. I actually feel sorry for Anna. She is a very nice person. If I were in her position, I’d certainly be looking for another job and letting someone else take the brunt of the dissatisfaction. You can read her articles here.

I also understand why Ancestry is doing what they are doing – meaning their decision to not create a chromosome browser/match matrix tool. It makes sense if you sit in their seat and now have to look at dealing with almost a million people who will wonder why they have to use a chromosome browser and or other tools when they expected their tree to grow while they were away.

I don’t like Ancestry’s position, even though I understand it, and I hope that we, as a community, can help justify the investment to Ancestry in some manner, because I fully believe that’s the only way we’ll ever get a chromosome browser/match matrix type tool. There has to be a financial benefit to Ancestry to invest the dollars and time into that development, as opposed to something else. It’s not like Ancestry has additional DNA products to sell to these people. The consumers have already spent their money on the only DNA product Ancestry offers, so there is no incentive there.

As long as Ancestry’s typical customer doesn’t know or care, I doubt that development of a chromosome browser will happen unless we, as a community, can, respectfully, be loud enough, long enough, like an irritating burr in their underwear that just won’t go away.

The Future

What we “know” and can do today with our genomes far surpasses what we could do or even dreamed we could do 10 years ago or even 5 or 2 years ago. We learn everyday.

Yes, there are a few warts and issues to iron out. I always hesitate to use words like “can’t,” “never” and “always” or to use other very strongly opinionated or inflexible words, because those words may well need to be eaten shortly.

There is so much more yet to be done, discovered and learned. We need to keep open minds and be willing to “unlearn” what we think we knew when new and better information comes along. That’s how scientific discovery works. We are on the frontier, the leading edge and yes, sometimes the bleeding edge. But what a wonderful place to be, to be able to contribute to discovery on a new frontier, our own genes and the keys to our ancestors held in our DNA.

______________________________________________________________

Disclosure

Thank you so much.

DNA Purchases and Free Transfers