The Autosomal Me – DNA Analysis – Splitting Up

DNA Analysis purchased 1-24-2013

This is Part 6 of a multi-part series.

Part 1 was “The Autosomal Me – Unraveling Minority Admixture” and Part 2 was “The Autosomal Me – The Ancestors Speak.”  Part 1 discussed the technique we are going to use to unravel minority ancestry, and why it works.  Part two gave an example of the power of fragmented chromosomal mapping and the beauty of the results.

Part 3, “The Autosomal Me – Who Am I?,” reviewed using our pedigree charts to gauge expected results and how autosomal results are put into population buckets.  Part 4, “The Autosomal Me – Testing Company Results,” shows what to expect from all of the major testing companies, past and present, along with Dr. Doug McDonald’s analysis.  In Part 5, “The Autosomal Me – Rooting Around in the Weeds Using Third Party Tools,” we looked at 5 different third party tools and what they can tell us about our minority admixture that is not reported by the major testing companies because the segments are too small and fragmented.

In this segment, Part 6, “DNA Analysis – Splitting Up” we’re going to focus on specific aspects of those tools and begin our analysis of our minority ancestry.

Analysis.  Sounds like I’m climbing on the shrink’s couch.  But I’m not, I’m saving all my dollars for DNA kits!  Besides, I don’t want to stop!  This analysis, we’ll do by putting several pieces of data together and sorting the wheat from the chaff.  And yes, we’ll be splitting up…well…splitting our DNA up into pieces contributed by our father and mother.

Let’s start with looking at the DNA segments that mother and I share that are Native.

According to Doug McDonald, we have significant Native matches on chromosomes 1 and 2, with third party tools confirm that finding.  Unfortunately, the only company where Mom’s DNA resides is Family Tree DNA whose test did not reveal the Native ancestry.  23andMe did confirm Native segments in my DNA in those locations.

I’ve used several third party tools at GedMatch to see where Mom and I both have Native heritage, where she has it and I don’t, and equally as important, where I have it and she doesn’t?  What is that so important?  Simple, it means my father had Native heritage too, and tells me on which chromosomes his Native DNA is located  This could, when matching people in the future, on particular segments, help to isolate who our common Native ancestor was, or at least which line.  That is the ultimate goal we are working towards with this entire process.

In this case, to identify my father’s Native lines, if Mom and I neither or both have Native markers at a particular chromosome location, the values are irrelevant, because the Native lineage came from mother.  I did notice in a few cases that I had more than mother, and of course, in that situation, it means that my father contributed some too, or my mother had a misread in that region or a categorization issue exists.  For that reason, I am looking for patterns, not single instances.  We’ll discuss using patterns in a future segment.

Using the MDLP chromosome mapping tool, as MDLP appears to be the most comprehensive, I created a spreadsheet using my results as a base.  I then added mother’s values in the spaces where I had no values, and then I highlighted my results in the locations where mother had no value.  The essence of this is that the red, bold, underscore values mean Mom had a Native result here, but I didn’t receive it.  A yellow highlighted cell means I got the entire amount from my father, because my mother has no percentage showing.  In other cases, of course, it’s possible that both mother and father contributed Native ancestry on some adjacent chromosome segments.  The MDLP mapping tool with my additions is shown below for chromosomes one through eight.  Chromosomes 9-22 are similar, but the chart is too big to display as a whole.  This provides an example of how to do this analysis with your own results.

MDLP Chromosome Map Table

The results were very interesting.

My two primary regions, North-East-Europe and Atlantic-Mediterranean-Neolithic, were represented on every chromosome for both my mother and myself.  No surprises there.  The other regions would be considered minority admixture.

In 2 categories, North-European-Mesolithic and East Siberian, only my father contributed genetic material on some chromosomes and there were no chromosomes where my mother alone contributed.

In 1 category, Melanesia, only my mother contributed genetic material on some chromosomes and there were no chromosomes where my father alone contributed.

In all other categories, both parents contributed on some chromosomes where the other didn’t.  This is important, because it will allow me to associate a match with a particular segment of a chromosome on a particular parent’s side with Native ancestry.

In the minority categories for Native American, Mesoamerican, Arctic-Amerind, South America Amerind and North Amerind, grouped together, both parents contributed on some chromosomes where the other didn’t, and in two categories, on 3 chromosomes, I carry more than my mother, indicating an additional contribution from my father.

This is a repeated occurrence, with Native ancestry for my parents and I combined showing on a total of 42 chromosome locations across 4 geographic/ethnic categories, and in at least three cases, both parents contributed.

In the African categories, South African, Sub-Saharan and Pygmy, I had contributions from both parents on a combined total of 18 chromosome segments.  The African admixture, in total, was less than the Native, and they are assuredly below 5% combined.  If they were present at higher levels, I wouldn’t need to go through these genetic gyrations to prove or disprove the heritage and which parent contributed, because it would be evident in the testing results of all companies.

In our next segment, Step 7, we will be further scrutinizing Chromosomes 1 and 2 for additional information about Native heritage and assigning specific Native segments that I carry on various chromosomes  to either my mother or father’s lineage.

______________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

The Autosomal Me – Rooting Around in the Weeds Using Third Party Tools

This is Part 5 of a series.

Part 1 was “The Autosomal Me – Unraveling Minority Admixture” and Part 2 was “The Autosomal Me – The Ancestors Speak.”  Part 1 discussed the technique we are going to use to unravel minority ancestry, and why it works.  Part two gave an example of the power of fragmented chromosomal mapping and the beauty of the results.  Part 3, “The Autosomal Me – Who Am I?,” reviewed using our pedigree charts to gauge expected results and how autosomal results are put into population buckets.  Part 4, “The Autosomal Me – Testing Company Results,” shows what to expect from all of the major testing companies, past and present, along with Dr. Doug McDonald’s analysis.

In this segment, Part 5, we’re going to look at various third party tools and what they can do for our search for minority admixture.  We will use the download files from either 23andMe and Family Tree DNA and utilize third party tools to analyze the raw data.  We’ll see how third party developers put those puzzle pieces together, if the results are consistent and what they tell us.

The Weeds

When dealing with testing companies, particularly any individual source (as opposed to multiple testing company results, as I have done), minority admixture, especially less than 1% may not be successfully recognized.  One percent equates to between 6 and 7 generations or about to the 1800 threshold in time.  However the history of both African and Native admixture in colonial America goes back another 200 years to the Jamestown era.

The social history in the US means that there are many people looking for this admixed heritage as long ago as 1609 when Jamestown was established and the first European/Native marriages took place (although there were “blonde Indians” reported by Jamestown settlers).  In round numbers, that’s about 400 years or between 13 and 16 generations.  Of course, a minority ancestor drops below the 1% threshold between 7 and 8 generations (with the first generation being the person tested) and by the time you get to the 12th generation, you’re at .048%.  At this level, Bennett Greenspan says we’re “rooting around in the weeds,” and he’s right.

However, rooting around in the weeds for those dreaded IBS (Identical by State) segments in genealogy is exactly what we need when looking for small amounts of minority admixture.  What’s an IBS segment you ask?  It’s a segment that is typically too small to be counted as an IBD, or identical by descent, segment.  IBS means that you’re from a common population if you match someone with a very small segment, not necessarily that you share a common ancestor within the past several generations.  But how to you tell if a small segment is IBS or IBD?

There is no absolute line in the sand, but often segments smaller than 7cM (centimorgans) or 700 SNPS (some say 5cM and 500 SNPs) fall into the IBS category.  This has caused some researchers to discard all segments of this size because they can’t tell the difference.  That’s unfortunate, because clearly some of these segments are IBD and the IBS segments can be useful too.

When looking for minority admixture in two people, both of them having these small segments in the same location can provide meaningful information, and can confirm minority heritage.  Said another way, if two people have less than 1% Native heritage, both share a common ancestor, and both carry part of their “less than 1%” on the same segment….one might say it’s not likely to be coincidence.  Identifying the common segments of your common ancestor can lead to identifying the specific family line those segments came from, especially if you match others as well.  This is in essence what Minority Admixture Mapping, or MAP, does.  It uses these techniques to look for patterns in these small fragmented pieces that, when taken together, indicate minority heritage.  Having said that, some IBS segments will indeed, be simply that, because you share the same base population, but some will be IBD, or more current in time.  With the MAP technique, we’re sorting through ways to utilize these small segments, whether they are IBS or IBD.

Using the tools, MDLP, Eurogenes, Dodecad and HarrappaWorld at GedMatch allows us to “root around in the weeds,” to quote Bennett, and find those all-important small IBS/IBD segments that connect us to a particular ethnicity and ultimately, to other relatives who carry these same segments in the same locations.

In general, using these this type of DNA is called BGA, or Biogeographical Ancestry where we use SNPs of autosomal DNA called AIMs, Ancestry Informative Markers.  A SNP is a Single Nucleotide Polymorphism, or a mutation that happened in one specific location on a gene.  AIMs are generally SNPs, not clusters of markers, found at different frequencies in different populations.  We combine all we know about them scientifically with information about population frequencies and then draw inferences about where our ancestors came from based on that information.  So a SNP that is useful in determining ancestry is called an AIM.

These SNPs, or AIMs, are the foundation for these BGA tools that we will be using to sort through small segments of minority admixture.  So this is a building block process.  Scientists identify SNPs found in different populations at different frequencies and identify them as such, then scientists and genetic genealogists create BGA tools that use and combine SNPs/AIMs to suggest populations and ethnicities for those who carry them.  Using these tools, majority ancestry is easy to discern.  We’re going to use those tools to look at groups of SNPs/AIMs clustered in small, fragmented IBS or IBD segments to do Minority Admixture Mapping (MAP) to confirm our minority admixture and to identify our minority admixed lines, families and perhaps even (in time) our original minority ancestor.

I bet you thought I couldn’t fit all of those acronyms in one paragraph, but I did:)  It is a bit like alphabet soup, but when you understand that this is a building process, it’s much easier to grasp as a whole.

Having at least one parents DNA makes this process much easier, because you can immediately tell if your other parent, by inferrence or process of elimination, has contributed any of the minority ancestry, or if it’s all on one side of the tree.  Of course, that’s assuming your parents aren’t related to each other.  There’s a test for that too at GedMatch.  If you don’t have one parent available, you can “make do” with aunts, uncles and cousins, but it’s a much more tedious process.

Third Party Tools

To use any of these BGA tools, you’ll need to download your results from either 23andMe, Family Tree DNA or National Geographic.  Currently at GedMatch, the only supported formats are 23andMe or Family Tree DNA, because the National Geographic test is so new.  I used my Family Finder (Illumina Build 36) raw data file.

To download your results from 23andMe, sign on to your account, then click on this link and it will take you to the area to download your results.

https://www.23andme.com/you/explorer/

Save the file and do not open it as the act of opening it sometimes causes corruption and you will have a hard time uploading the file.  If the upload fails, download a new copy and start over.  If you have an older copy on your computer, it’s always a good idea to use a fresh copy to incorporate any changes made by the vendor since your last file download.

To download your results from Family Tree DNA, sign on to your personal page, click on the Family Finder tab and then on “Download Raw Data.”  As I write this, Family Tree DNA is in the midst of a conversion from Build 36 to Build 37 for their autosomal files (in order to facilitate the integration of 23andMe results), so you may need to be a bit patient while this process completes.  Files may not be available for download at some points.  You certainly don’t want to mix comparisons, meaning using one build 36 and one build 37 file for comparison.

If you’re following this process yourself with your own data, please read all the way through this posting before starting your own processing.

Now, let’s look at the third party tools.

Stanford University

This tool is available at Stanford University.  Scientists have collaborated to provide this service and I think it’s quite interesting.  This tool is not compatible with any browser except Chrome and it requires a download of your autosomal data in a .txt file.  If it can’t load your file, the loading task simply never completes.  For me, that meant it wasn’t a .txt file I was trying to load.

http://esquilax.stanford.edu/

Load your file and choose Ancestry, then Paintings, then Hap Map 3 (experimental), then Paint my Chromosomes.

weeds 1

Their legend, above, translates to the regions, below.

ASW – African ancestry in Southwest USA

CHD – Chinese in Metropolitan Denver, Colorado

GIH – Gujarati Indians in Houston, Texas

LWK – Luhya in Webuye, Kenya

MEX – Mexican Ancestry in Los Angeles

TSI – Toscani in Italia

weeds2

Unfortunately, this isn’t terribly useful.  Hap Map 3 utilizes additional regions, including Utah, but this tool doesn’t seem to be mapping them, so my closest match region is Italy, which is midleading since none of my family was from Italy.  Hap Map 2 is also an option which does include the Utah population, but it’s not as up to date otherwise as Hap Map 3.

David Pike has figured out how to tweak these settings some.  You can read about it at this link:  https://www.23andme.com/you/community/thread/8062/.  David’s posting on June 20th shows what he did.  However, compared to the other tools available, I find this a poor choice and did not spend a lot of time trying to work with it.

However, a second feature that they provide is fun.

Stanford provides a Neanderthal tool that’s a little different than the Nat Geo or 23andMe ones.  Click on Explore, Neanderthal, Look Up Exercise.  Then enter your primary ethnicity and click on Look Up Exercise again.

Of a possible 84 Neanderthal alleles, I have 9, partially displayed below.

weeds3

GedMatch

www.Gedmatch.com is a complimentary (voluntary contribution) site created by two genetic genealogists that includes several autosomal analysis tools.  One of the areas of this site is “Admix Tools.”  On that page one finds several private or proprietary tools, some written by genetic genealogists, some by researchers, and all free.  Let’s take a look at each one and their results.  If you want to see any of the results more closely than the photos here allow, you can run each of the comparisons using kit F6656 (mine) as the first kit and kit F9141 (my mother) as the second kit.

Each of these tools offers the same functionality, as follows.

weeds 4

We will be utilizing 4 of these functions for each tool.

  • Admixture Proportions
  • Admixture Proportions by Chromosome
  • Chromosome Painting
  • Paint Differences between 2 kits, 1 chromosome

We select from the tools as follows:

weeds 5

Let’s take a look at what the tools provide.

MDLP World 22

The MDLP software is sponsored by two genetic genealogists.  You can read more about the project at http://magnusducatus.blogspot.com/ and http://magnusducatus.blogspot.com/2012/09/behind-curtains-mdlp-world-22-showcase.html.

weeds 6

MDLP shows several populations.  I was interested to see if my mother also shared the African percentage.  Interestingly, mother does have a South African segment, but it’s .12, so less than mine.  Therefore, I would have obtained part of my African heritage from my father.  She also has three different categories of Native American heritage, compared to my one.  She carried a total of 1.92% and I carry .58%.  Otherwise, our results are very similar.

weeds 7

The next feature is ethnicity mapping by chromosome.  While the display is too large to see well it’s interesting to note that indeed, both Native American and African were detected on several chromosomes, not just on chromosomes 1 and 2 as reported by 23andMe and Dr. McDonald.  Note that DeCode Genetics showed “East Asian” admixture on several chromosomes.

weeds 8

Here’s a portion of the above chart that you can actually see.  The highlighted blue regions are your major ethnic regions.

weeds 9

Another feature is chromosome painting, shown below.  This shows the first part of my chromosomes 1 and 2 painted by ethnic/regional breakdown.  The legend for each tool is different and above their graph.

weeds 10

weeds 11

These tools also provide the ability to compare one chromosome between two people.  On the graph below, my chromosome 1 is on the top, and my mother’s is second, with the third band being our common painting.  The black represents non-shared regions, meaning those contributed to me by my father.  Unfortunately, North American Native American is dark grey, sometimes difficult to distinguish from black.

weeds 12

The graph below shows that while I do share a large piece of Chromosome 1’s Native region (about 160-180mb) with my mother, there are also segments, 169-170 for example, where I have Native genes that she does not, indicating Native heritage in this location from my father’s side.

weeds 13

Eurogenes K9

Eurogenes was created by another genetic genealogist.  You can read more about it at http://bga101.blogspot.com.au/2012/04/eurogenes-admixture-utilities-at.html.

weeds 14

Eurogenes calls me primarily North European with .67 Native American and no African in the percentages above, but below, on the individual chromosomes, some African does show, although not on as many chromosomes as MDLP.

weeds 15

In the charts above and below, you can see that Eurogenes detected small amounts of African along with Native American.

weeds 16

weeds 17

Notice that at about 10mb on chromosome 1, on the graph below in the top band, that the North American Indian (yellow) and the South Asian (red) are imbedded with each other.  These appear again together at the beginning of chromosome 2, shown as the second band.  This hints at how and why it’s sometimes so difficult to determine and filter Native American from Asian.  There is no line in the sand, there is a continuum between populations, the only differentiator being 10,000 to 15,000 years spent apart in which time, they, hopefully, developed enough differentiating mutations that we can tell them apart.

weeds 18

On the chart below, the top band shows the chromosome painting of my chromosome, and the second band shows the chromosome 1 Native American segment (about 160-180 mb) of my mother with the third band showing both matching and non-matching regions, painted black.  Looking at the segment of chromosome 1, in the graph below, characerized as Native, we can see in mine, top row, that this is categorized as Native American (yellow), but some of the same regions below, in Moms are categorized as South Asian (red), causing a technical non-match, when in reality, It’s likely a categorization issue, not a genetic mismatch.  In future analysis, we’ll be using two methods of comparison, one called “Strong Native” that only matches Native to Native and another, the “Blended Asian” method that allows for grouping of similar ancestral types that together likely indicate a Native heritage.

weeds 19

Dodecad V3

Dodecad was created by an anthropologist.  You can read more about it at http://dodecad.blogspot.com/ and http://dodecad.blogspot.com/2011/06/design-of-dodecad-v3.html.

weeds 20

Dodecad, unfortunately, does not subdivide into Native American, so the Native will show here as some form of Asian.  Northwest Africa shows in the percentages above, but more detailed African heritage shows in the chromosome detail below in regions not shown above.

weeds 21

weeds 22

weeds 23

weeds 24

Above, my chromosome painting for the first part of chromosomes 1 and 2.

Below, the comparison showing the Native segments from about 160-180mb.   My Native segment (top) compared to mother’s (middle) with the comparison of the two on the bottom for chromosome 1.

weeds 25

HarappaWorld

HarappaWorld divides results into fewer population groups and is focused on Asia.  You can read more about it at http://www.harappadna.org/2012/05/diy-harappaworld/.

weeds 26

In HarrappaWorld, Beringian and American appear to be equivalent to Native American.  Like Dodecad and Eurogenes, African does not show in the total percentages, but does on the individual chromosome analysis, although in smaller percentages with this application.

weeds 27

weeds 28

weeds 29

Chromosome painting of my chromosomes 1 and 2 are shown below.

weeds 30

The graphs below show the Native region comparison of chromosome 1 between me, top row, mother, middle row, and the third graph showing the common areas, with black representing areas where there is no match.

weeds 31

For each of these tools and their results, we’ll do further analysis in a future segment of this series.

Tools Summary

Now that we’ve looked at these individual tools,  and building on the Test Results Chart created in Parts 3 and 4, let’s compare and see what information these tools add.

Test Results Chart Including Third Party Tools

Test/Company European Asian Native African Unknown
Pedigree Analysis

75%

0

~1%

0

24%

Testing Companies
Family Tree   DNA – Original

100%[1]

0

0

0

deCodeme

92%

5%

Inferred[2]

3%

deCodeme –   X

91%

6%

Inferred

3%

Dr.   McDonald

97-99%

1-3%

0.5%

0

23andMe –   Original

99%

1%

Inferred[3]

0

0

23andMe –   2012 – Standard

99.2%[4]

0

.5%

0

.3%

23andMe –   2012 – Conservative

98.7%[5]

0

.3%

0

1%

23andMe –   2012 – Speculative

99.3%[6]

0

.5%

0

.2%

Family Tree   DNA – 2012

100%[7]

Geno 2.0

79%[8]

18%

0

0

0

Ancestry

92%[9]

0

0

0

8%

Third Party Tools
MDLP

86.68%

12.55%

.58%

.17%

0

Eurogenes

94.83%

4.5%

.67%

0

0

Dodecad

85.47%

13.43%

Inferred

1.09%

0

HarrappaWorld

86.56%

12.80%

.65%

0

0

Of the various chromosomes, the breakdown is as follows. Dodecad does not break the categories in a comparable fashion to these other 3 tools, so their results are omitted in the following chart.  Please note that how geographies are categorized can make a significant difference.

Minority by Chromosome Chart

Tool/Chr MDLP Native Eurogenes Native Harrappa Native MDLP African Eurogenes African Harrappa African
1 Y Y Y N N N
2 Y Y Y Y Y N
3 N N N Y Y N
4 Y N Y N N N
5 N N N N N N
6 Y Y Y Y N N
7 N N Y N N N
8 Y Y Y Y N Y
9 Y N N Y N N
10 Y N N Y N N
11 Y N Y Y N N
12 Y N Y N N N
13 Y N Y N N N
14 Y Y Y Y N N
15 Y N N N N Y
16 Y Y Y Y N N
17 Y Y Y N N N
18 N N N N N N
19 Y Y Y Y N N
20 Y Y Y Y N N
21 Y N Y N N N
22 N N N Y Y N

From these various tools, it’s obvious that I do have some Native admixture, probably about 1%, and it’s from both parents.  I also have some African, but it looks to be an even smaller proportion that Native American.

Join me for Part 6 of this series, where we look at how to analyze and use this information.


[1] 71.5% western European, 28.4% Northeastern European

[2] Inferred that Asian is actually Native in an American with no history of Asian ancestry.

[3] No category, inferred.

[4] 78.6% Northern European, 1.8% Southern European, 18.7% Nonspecific European

[5] 54.6% Northern European, .3% Southern European, 43% Nonspecific European

[6] 91.7% Northern European, 3% Southern European, 3.3% Nonspecific European

[7] 75.18% West Europe (French and Orcadian), 24.82 Europe (Romanian, Russian, Tuscan and Finnish).  Note that my mother’s results are almost identical except the Finnish is missing from hers.

[8] 43% North Europe and 36% Mediterranean

[9] 80% British, 12% Scandinavian

______________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

2012 Top 10 Genetic Genealogy Happenings

2012 has been a very busy year for genetic genealogists.  There have been lots of discoveries and announcements that affect everyone, now and in the future.  The watchwords for 2012 would be “churn” and “explosive growth.”  Let’s take a look at the 10 most important events, why they are important and what they mean for the future of genetic genealogy.

These items are in what I think are relatively good order, ranked by their importance, although I had a very difficult time deciding between number 1 and 2.

1. The New Root – Haplogroup A00

At the Family Tree DNA conference in November, Michael Hammer, Bonnie Schrack and Thomas Krahn announced that they had made a monumental discovery in the age of modern man known as Y-line Adam.  The discovery of Haplogroup A00 pushes the “birth” of mankind back from about 140,000 years ago to an amazing 338,000 years ago.  Utterly amazing.  The DNA came from an American family from South Carolina.  This discovery highlights the importance of citizen science.  Bonnie is a haplogroup administrator who recognized the potential importance of one of her participants’ DNA.  Thomas Krahn of course is with Family Tree DNA and ran the WTY test, and Michael Hammer is at the University of Arizona.  So you have the perfect blend here of participant, citizen scientist, commercial lab and academia.  What was never thought possible a decade or so ago is not only working, it’s working well and changing the face of both science and humanity.

http://dna-explained.com/2012/11/16/the-new-root-haplogroup-a00/

http://www.haplogroup-a.com./

2. Geno 2.0

Geno 2.0 is the Nickname for the National Geographic Society’s Genographic Project version 2.0.  That mouthful is why it has a nickname.

This amazing project has leveraged the results of the past 7 years of research from the original Genographic project into a new groundbreaking product.  Geno 2.0, utilizing the GenoChip, a sequencing chip created specifically for Nat Geo, offers the most complete Y tree in the world today, expanding the SNP tree from just over 800 SNPs to over 12,000.  They are in essence redrawing the Y chromosome tree as I write this.  In addition, the person who purchases Geno 2.0 will receive a mitochondrial DNA haplogroup assignment.  Over 3300 new mitochondrial mutations were discovered. A brand new anthropological “percentages of ethnicity” report is featured based on over 75,000 Ancestry Informative Markers, many only recently discovered by the Genographic project.  Additionally, participants will receive their percentage of both Neanderthal and Denisovan ancestry based on 30,000 SNPs identified that signal interbreeding between the hominids.  A new website will also facilitate social networking and uploading information to Family Tree DNA.

The wonderful news is that there is a massive amount of new information here that will change the landscape of genetic genealogy.  The difficulty is that we are struggling a bit under the load of that massive amount of information that is just beginning to descend upon us.  It’s a great problem to have!

http://dna-explained.com/2012/07/25/national-geographic-geno-2-0-announcement-the-human-story/

http://dna-explained.com/2012/07/26/geno-2-0-qa-with-bennett-greenspan/

http://dna-explained.com/2012/07/30/geno-2-0-answers-from-spencer-wells/

http://dna-explained.com/2012/07/31/geno-2-0-wty-mtdna-full-sequence-participants-and-more/

http://dna-explained.com/2012/10/14/what-to-order-geno-2-0-vs-family-tree-dna-products/

http://dna-explained.com/2012/10/16/geno-2-0-the-kit-arrives/

http://dna-explained.com/2012/12/11/geno-2-0-results-first-peek/

http://dna-explained.com/2012/12/12/geno-2-0-results-kicking-the-tires/

3. Reconstructed Sapiens Reference Sequence (RSRS)

In July, Family Tree DNA implemented the RSRS that in effect reconstructs the genetic profile of Mitochondrial Eve and bases the comparison of our DNA today against the RSRS sequence as opposed to the Cambridge Reference Sequence (rCRS) created in 1981 that is or was the current standard.  The RSRS is a result of the watershed paper published in April 2012 by Dr. Doron Behar and 8 other authors titled “A “Copernican” Reassessment of the Human Mitochondrial DNA Tree from its Root.”  A complementary research website, www.mtdnacommunity.org, accompanies the paper.

http://dna-explained.com/2012/07/14/what-happened-to-my-mitochondrial-dna/

http://dna-explained.com/2012/07/15/the-crs-and-the-rsrs/

http://dna-explained.com/2012/07/16/the-mtdna-community/

http://dna-explained.com/2012/12/02/little-a-big-a-mitochondrial-dna/

4. Full Genome and Exome Sequence Offered Commercially by Gene by Gene

It was announced at the November DNA conference that Gene by Gene, the parent company of Family Tree DNA, through their division titled DNA DTC is offering full genomic sequencing for the amazing price of $5495 for the full genome and $695 for the exome.  This is a first in the consumer marketspace.  Today, this doesn’t have a lot of application for genetic genealogy, but as the price continues to drop, and utilities are built to process the full genomic data, certainly a market and applications will emerge.  This is an important step forward in the industry with a product that still cost 3 million dollars in 2007.

http://dna-explained.com/2012/11/30/gene-by-gene-announces-landmark-dna-dtc-full-genome-sequence/

5. Neanderthal and Denisovan DNA

It’s official – they did it.  Yep, they interbred and well, they are not them anymore, they are us.  Given that everyone in Asia and Europe carries a part of them, but not people from Africa, it would appear that two populations admixed rather thoroughly in Eurasia and/or the populations were small.  The amount of Neanderthal and Denisovan DNA will continue at approximately the proportions seen today in Europe (2% Neanderthal) and Asia unless a significant amount of admixture from a population (Africa) that does not carry this admixture is introduced.  So if you’re European, you carry both Neanderthal and Denisovan DNA.  They are your ancestors.  The good news is that you can find how much of each through  the Geno 2.0 test.  23andMe results give you the percentage of Neanderthal, but not Denisovan.

http://dna-explained.com/2012/08/31/denisovan-dna-tells-a-story/

http://dna-explained.com/2012/12/12/geno-2-0-results-kicking-the-tires/

6. Ancestral Genome Reconstruction Begins,  Led by Falling Autosomal Prices and the Ability to Fish in Multiple Ponds

2012 has been the year of autosomal testing price reductions and a great deal of churn in this marketspace.  Companies are playing leap-frog with one another.  However, sometimes things are not all that they seem.

Initially, 23andMe opted for an initial payment plus monthly subscription model, which they abandoned for a one time payment price of $299 in early 2012.  Family Tree DNA was slightly less, at $289.

Ancestry led the price war by giving away kits, then selling them for $99, then $129 plus a subscription as an entrance into this market.  However, looking at the Ancestry consent form hints at possible reasons why they were selling below the cost of the tests.  You are in essence giving them permission to sell your DNA and associated information.  In addition, to gain full access to your results and matches, you must maintain some level of subscription to Ancestry.com, increasing the total effective price.

Next came Family Tree DNA’s sale where they dropped their autosomal price to $199, but they were shortly upstaged by 23andMe whose price has now dropped to $99 permanently, apparently, a result of a 50 million dollar investment in order to reach 1 million customers.  They currently have about 180,000.  23andMe has always been in the medical/health business, so their clients have always understood what they were consenting to and for.

Not to be outdone, Family Tree DNA introduced the ability earlier in 2012 to upload your data files from 23andMe to FamilyTree DNA for $89, far less than a second test, which allows you to fish in a second pond where genealogists live for matches.  The challenge at 23andMe is that most of their clients test for the health traits and either don’t answer inquiries or match requests, or know little about their genealogy if they do.  At Family Tree DNA, matches don’t have to answer and allow a match, testers are automatically matched with all participants who take the Family Finder test (or upload their 23andMe results) and testers are provided with their matches’ e-mail address.

Of course, Geno 2.0 was also introduced in the midst of this, in July, for $199 with the additional lollipop of new SNPS, lots of them, that others simply don’t have access to yet.

The good news is that consumers have benefitted from this leapfrogging, I think.  Let’s hope that the subsidized tests at Ancestry and 23andMe don’t serve long term to water down the demand to the point where unsubsidized companies (who don’t selling participants genetic results to others) have problems remaining viable.

Personally, I’ve tested at all of these companies.  I’ll be evaluating the results shortly in detail on my blog at www.dna-explained.com.

The tools provided by most testing companies, plus GedMatch, and multiple ponds to fish in are allowing the serious genetic genealogist to “reconstruct” their genome, attributing segments to specific ancestors.  Conversely, we will also be able to “reconstruct” specific ancestral family lines as well by identifying autosomal segments in multiple descendants.  This new vision of autosomal genetic genealogy will allow much more accurate ancestral line matching, and ancestor identification in the not-so-distant future.

http://www.yourgeneticgenealogist.com/2012/01/family-tree-dna-now-accepting-23andme.html

http://www.yourgeneticgenealogist.com/2012/05/23andme-eliminates-subscription-model.html

http://www.yourgeneticgenealogist.com/2012/10/clarification-of-what-is-available-to.html

http://www.yourgeneticgenealogist.com/2012/12/23andme-receives-50-million-and-drops.html

http://www.thegeneticgenealogist.com/2012/12/26/23andme-and-labcorp-sued-for-patent-infringement/

7. Ethnicity Tests Mature – Minus 1

The good news is that the various ethnicity tests (known as BGA or biogeographical ancestry tests) that provide participants with their percentages of various world populations are improving.  The bad news is that there is currently one bad apple in the card with very misleading percentages – and that is Ancestry.com.

23andMe introduced a new version of their ethnicity product in December, expanding from only 3 geographic categories to several.  The Geno 2.0 test results are just beginning to be returned which include ethnicity predictions and references to several base populations.

Family Tree DNA finally has some competition in this arena where for years they have been the only serious player, although opinions differ widely about which of these three organizations results are the most accurate.  All four are Illumina chip based, using hundreds of thousands of locations, as compared with the previous CODIS type tests which used between 15 and 300 markers and are now outdated.  All companies use different reference populations which, of course, provide somewhat different results to participants.  All companies, except Ancestry, have documented and shared their reference population information.

Outside of these companies, Doug McDonald offers a private analysis and Gedmatch offers a series of BGA comparisons written by third parties.

While this industry continues to grow and mature, I’m thinking about just averaging the autosomal ethnic results and calling it good:)

http://dna-explained.com/2012/07/21/ethnicity-finders/

http://dna-explained.com/2012/10/24/ancestrys-mythical-admixture-percentages/

http://dna-explained.com/2012/12/07/new-worldview-at-23andme/

http://dna-explained.com/2012/09/09/doug-mcdonald-on-biogeograpical-analysis/

http://dna-explained.com/2012/12/11/geno-2-0-results-first-peek/

http://www.yourgeneticgenealogist.com/2012_12_01_archive.html

8. Finding Your Roots PBS Series with Henry Louis Gates

PBS sponsored a wonderful series in the spring of 2012 hosted by Henry Louis “Skip” Gates, the chair of African American Studies at Harvard.  This series followed a lesser known 2010 series.  The 2012 inspirational series reached tens of thousands of people and increased awareness of genetic genealogy as well as sparked an interest in genealogy itself, especially for mixed race and African American people.  I was disappointed that the series did not pursue the Native American results unexpectedly obtained for one participant.  It seemed like a missed opportunity.  Series like this bring DNA testing for genealogy into the mainstream, making it less “strange” and frightening and more desirable for the average person.  These stories were both inspirational and heartwarming.  I hope we can look forward to similar programs in the future.

http://en.wikipedia.org/wiki/Finding_Your_Roots

CeCe Moore covered this series in March and April on her blog.

http://www.yourgeneticgenealogist.com/2012/03/finding-your-roots-with-henry-louis.html

http://www.yourgeneticgenealogist.com/2012/04/finding-your-roots-with-henry-louis.html

http://www.yourgeneticgenealogist.com/2012/04/finding-your-roots-with-henry-louis_09.html

http://www.yourgeneticgenealogist.com/2012/04/finding-your-roots-with-henry-louis_16.html

http://www.yourgeneticgenealogist.com/2012/04/finding-your-roots-with-henry-louis_23.html

http://www.yourgeneticgenealogist.com/2012/04/finding-your-roots-with-henry-louis_30.html

9. Ancestry, GeneTree and Sorenson

GeneTree, a for profit company and Sorenson, a non-profit company were both purchased by Ancestry.com.  This was about the same time as Ancestry introduced their autosomal AncestryDNA product.  Speculation was that the autosomal results at Sorenson might be the foundation for the new autosomal test comparisons, although there has been no subsequent evidence of this.

Ancestry initially gave away several thousand kits in order to build their data base, then sold thousands more for $99 before raising the price to what appears to be a normalized price of $129 plus an annual ancestry subscription.

While GeneTree was never a major player in the DNA testing marketspace, Sorenson Molecular Genealogical Foundation played an important role for many years as a nonprofit research institute.  There was significant distress in the genetic genealogy community related to the DNA contributed to Sorenson for research being absorbed by Ancestry as a “for profit” company.  Ancestry is maintaining the www.smgf.org website, but no additional results will be added.  Sorenson has been entirely shuttered.  Many of the Sorenson/GeneTree employees appear to have moved over to Ancestry.

The initial AncestryDNA autosomal product offering is poor, lacks tools and the ethnicity portion has significant issues. It’s strength is that many people who test are already Ancestry subscribers and have attached their trees.  So you can’t see how you connect genetically to your matches (lack of tools), but you can see the trees, if they are attached and not marked as private, of those with whom you match.  Ancestry provides “hints” relative to matching individuals or surnames.

Eventually, if Ancestry improves its products, provides tools and releases the raw data to consumers, this may be a good thing.  It’s an important event in 2012 because of the massive size of Ancestry, but the product is mediocre at best.  Ancestry seems unwilling to acknowledge issues unless their feet are held to the fire publicly as illustrated with a “lab error” erroneous match for an adoptee caught by the consuming public and ignored by Ancestry until CeCe Moore exposed them in her blog.  Whether Ancestry ultimately helps or hurts the genetic genealogy industry is a story yet to be told.  There is very little positive press in the genetic genealogy community surrounding the Ancestry product, but with their captive audience, they are clearly going to be a player.

http://www.yourgeneticgenealogist.com/2012/05/ancestrycom-buys-genetree-and-launches.html

http://dna-explained.com/2012/07/12/did-you-test-at-genetree/

http://dna-explained.com/2012/08/30/is-history-repeating-itself-at-ancestry/

http://dna-explained.com/2012/07/18/the-trouble-with-ancestry-com-matches/

http://dna-explained.com/2012/08/14/y-dna-family-tree-dna-vs-ancestry/

http://dna-explained.com/2012/08/16/ancestrys-consent-form-for-ancestrydna-autosomal-test/

http://dna-explained.com/2012/09/10/ancestry-autosomal-results-are-back/

http://dna-explained.com/2012/10/15/ancestrys-dna-survey/

http://dna-explained.com/2012/10/23/ancestry-to-release-array-data-in-2013/

http://dna-explained.com/2012/10/24/ancestrys-mythical-admixture-percentages/

http://www.thegeneticgenealogist.com/2012/06/19/problems-with-ancestrydnas-genetic-ethnicity-prediction/

http://www.yourgeneticgenealogist.com/2012/08/ancestrydna-confusing-relationship.html

http://www.yourgeneticgenealogist.com/2012/08/follow-up-on-ancestrydna-and-adoptees.html

http://www.yourgeneticgenealogist.com/2012/09/23andme-says-no-match-for-adoptees.html

10. GedMatch

GedMatch, www.gedmatch.com, created by John Olson and Curtis Rogers, isn’t new in 2012, but it’s maturing into a tool that is becoming the defacto workhorse of the serious autosomal community.  People who test at either 23andMe or Family Tree DNA download their raw results and other match information and then use a variety of tools at GedMatch to look at results in different ways and using different thresholds. GedMatch is currently working to accept the newly arriving Geno 2.0 data files.  Ancestry does not at this time allow their customers access to their raw data files, so there is nothing to upload. The bad news is that not everyone downloads/uploads their information.  Only the most savvy users, and the download/upload is not always a smooth process, often necessitating several attempts, a magic wand and some fairy dust for luck.

GedMatch is a volunteer effort funded by donations on the GedMatch site.  The magnitude of this project came to light when they needed new servers this year because the amount of traffic disabled their internet service provider.  It may be a volunteer effort, but it has mainstream requirements.  Therefore, while occasionally frustrating, it’s easy to understand why it’s light on documentation and one has to poke around a bit to figure things out.  I would actually prefer that they make it a subscription site, clean up the bugs, add the documentation and take it to the next level.  It would also be very nice if they could arrange something with the major players in terms of a seamless data transfer for clients.  All told, it’s an amazing contribution as a volunteer site.  Hats off to Curtis and John for their ongoing contribution to genetic genealogists!!!

www.gedmatch.com

http://www.legalgenealogist.com/blog/2012/08/12/gedmatch-a-dna-geeks-dream-site/

______________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research