Ancestry Match Purge Update

Posted on July 30, 2020 by Roberta Estes

I’m covering four things in this article today:

Genetic currency and why it matters
Reasons for Ancestry’s purge
Ancestry’s updated plans
What’s next?

Why is Focusing on Ancestry Critical Right Now?

It’s much easier to save something that exists than to create something new in the software world.

Think of your car. It’s a lot easier for a car company to keep the same model year to year than to create a new model with the inherent design, engineering, and associated costs.

Yes, other DNA vendors could and should improve too, but right now, only Ancestry is taking something valuable away from genealogists. Regardless of what we want other companies, or Ancestry, to develop, providing feedback regarding Ancestry’s impending purge of our 6-8 cM matches is critical now, before the deletion occurs and is irreversible.

Some genealogists either don’t care or don’t specifically want to preserve smaller matches. That’s fine and they can simply ignore their smaller matches. Smaller matches DON’T HURT ANYONE. If you don’t like them, just ignore them.

Why would anyone be vehemently opposed to something that is agreed to be useful and valuable about 50% of the time? It has been widely accepted for years that 7 cM matches are valid about half of the time. Science tells us the same thing.

MMB stats by cM 2

Philip Gammon, a statistician, worked with sets of phased data to produce output indicating the rates of valid and invalid matches, meaning when the child matches someone and so does the parent. His numbers indicate that 6 and 7 cM matches were valid 66% and 58% of the time, respectively.

I worked with parent/child trios whose tests I control to determine the accuracy of matches phased to parents.

Ancestry phased matches accuracy.png

Working with parentally phased data, meaning when both parents have tested, a match matches either the mother or father in addition to the tester, the results indicated that matches between 6 and 6.99 cM were valid 30% of the time. Matches between 7 and 7.99 cm were valid 46% of the time. These percentages are smaller than Philips, but these groups are nonendogamous and Philip’s work included endogamous trios.

Parental phasing is the first step in confirming that a match is valid, regardless of the size. The smaller the size of the match, the more additional information is needed. We’re genealogists, we can do that!

shared cm quick reference

I created this combined quick reference chart from an analysis article I wrote based on the results of multiple resources and various testing companies. Note that we begin to see no matching at 3rd cousins, so we would also see 3rd cousins who match between 6-8 cM and those matches will be removed with the purge.

Clearly, smaller matches aren’t valid all of the time, but they certainly aren’t invalid all of the time either. Like any other record we use, they need to be critically evaluated.

Why would anyone care that other people want to use these tools for research?

If you type the name John Smith into a census search – you’re obviously looking for one specific John Smith. There are thousands. No one is advocating deleting the entire census collection because researchers are going to have to utilize some analytical skills to determine which specific John Smith is the ancestor for which they are searching.

Frankly, it’s no one’s business other than the researcher themselves, BUT, the researcher MUST HAVE THE RECORDS AVAILABLE to them in order to perform the analysis.

That’s the difference. Ancestry is deleting the DNA information between 6 and 8 cM leading to our ancestors and if they don’t reevaluate their decision now, once the data is gone, so is our opportunity to use it – forever.

Ancestry more tools

Don’t burn the house down because it needs to be cleaned.

Ancestry’s White Paper

Ancestry published a new matching white paper describing what they are doing, and why.

Ancestry white paper.png

Here’s the link directly, or you can access it at the top of your DNA Matches page.

Ancestry factor

This excerpt from page 13 is critical in understanding the motivation behind this purge.

Individuals on the initial July 13^th call with Ancestry reported that as many as 2/3^rds of people’s matches will be removed during the purge.

Since that time, my blog commenters and people who have emailed me directly are telling me that they will lose “more than 50%” of their matches. The numbers vary, but one person said it was well over 70% for her.

Unless you’ve previously used one of the download tools that have now been discontinued due to the cease-and-desist orders issued by Ancestry, to the best of my knowledge, you have no way of determining in advance how many of your matches fall in the 6-8 cM category and how many you will lose.

I’ve recorded how many total matches I have, but until the purge occurs, there’s no way to know how many of those I’ll lose. In other words, there’s no way for me to quantify my loss or complaint in advance.

Technology Costs Money

In technology terms, let me explain what this means to Ancestry.

Companies have to pay for data storage costs and processing one way or another.

The first way is by purchasing their own hardware, storage and processing equipment, which means as more people test, and more data needs to be stored and processed (matching), the company needs to spend more money for additional equipment.

If the firm doesn’t use their own hardware and the services are cloud-based, they still pay for storage by the amount of space and processing by the minute.

Your DNA kit was a one time purchase, mean a one-time revenue source for Ancestry, but the processor load of matching and storing match lists goes on forever. The only additional revenue source for your DNA, for Ancestry, if is you opted in for medical research or if you purchased a subscription that you would not have otherwise purchased.

It might also be worth noting here that Ancestry laid off 6% of its workforce, 100 people, in February, following in the footsteps of 23andMe, reported here, and that was before the economic downturn that all companies are experiencing now due to the ramifications of Covid.

I’m not surprised that Ancestry continues to seek cost-cutting measures and I am not criticizing them for doing so. I simply hope they will find methods where the burden isn’t directly born by their DNA customers.

The Definition of Small Segment Keeps Increasing

Initially, AncestryDNA included 5 cM matches. Those disappeared in 2016 when Timber arrived. At that point, Ancestry reported that academic (not parental) phasing plus Timber made matches more reliable, so 6 cM matches were supposed to be more reliable at Ancestry than unphased 6 cM or larger matches elsewhere. No one complained about 6 and 7 cM segment matches at that time or discarded them out-of-hand as unreliable, although people who work in this field have always cautioned testers to accumulate layers of evidence in their search.

Many researchers never get to those lower matches because they have many matches at higher levels. Matches are easy to ignore if you’re not interested.

Currently, matches in the 6 and 7 cM range are now being referred to as “small segments,” stated by some that they should never be used because they might be identical by chance and not identical by descent. The term “small segments” used to be reserved for segments below the matching threshold of the testing vendors which used to be 5 cM at Ancestry. The definition of “small segment” has crept up now to include 6 and 7 cM matches. Will it continue to creep upwards as it becomes advantageous? When will 8, 9, 10 cM matches, go away?

One of the justifications for ignoring or deleting smaller segments is that they are “far back in time,” but Ancestry’s documentation about 6 cM matches shows that 21% of the time, a 6 cM match is some flavor of 2^nd, 3^rd or 4^th cousin. That’s hardly far back in time.

Ancestry 6 cm relationship.png

Unknown, Previously Unidentified Ancestors

The need to identify ancestors who are unknown, meaning not just unknown to you – but truly not identified through prior research by anyone eventually affects all genealogists.

Researchers often encounter this situation when they have females with no surnames or when they are researching ancestors with no records at all.

My closet brick walls begin in the 6^th generation, all females, born in the 1760s and died in the 1800s. Their descendants in my generation would be 5^th cousins to me. That’s where my search for truly unknown ancestors begins.

Other people experience brick walls much closer to them in time.

The Good News – People Are Looking

There’s actually a silver lining to Ancestry’s announced purge – people are looking and evaluating these smaller matches now that the matches are in jeopardy of being removed.

Maybe Ancestry’s threat to remove these matches was a genius marketing ploy to encourage us to use them (wink, wink.) Let’s hope so and Ancestry retains those matches and continues to provide their customers with matches at this level.

Numerous people have stated that they are finding patterns in multiple matches, especially if they manage multiple kits for various family members. Because of the 20 cM shared match threshold limit at Ancestry, testers may not see other family members on their shared match list, but looking at their other family members’ actual match list – those smaller matches are sometimes there. Researchers are finding matches between 6-8 cM that match multiple family members. Finding those matches is the beginning of analysis.

Let me explain that a different way. I’m looking at my shared matches with person A. I see no shared matches below 20 cM because that’s Ancestry’s shared match display limit.

However, person A’s sibling, person B, also matches me below 20 cM, but I can’t see that shared match with person A because my shared match with person B is below 20 cM. However, checking my match list for person B’s name shows that they are a match to me. However, there is no way to know that I match person B in common with person A.

Then, checking another family member, like an aunt, for example, I see that person A and person B both match her as well, probably also on segments below 20 cM so she can’t see them on her shared match list either, nor can I see either of those matches, person A or person B on my shared match list with my aunt.

Reaching out to matches below 20 cM and asking if they have other family members you can check, by name, to see if they are on your match list is important. Many people don’t realize shared matches below 20 cM aren’t shown at Ancestry.

I know that, but sometimes I tend to forget that when viewing shared matches and have to remind myself.

Are You a Researcher Who Could Benefit from Smaller Segment Matches?

What types of researchers are finding interesting matches that they are pursuing and finding promising leads or beneficial connections? Truthfully, I hadn’t thought of several of these. Here’s what people have reported recently.

People with Irish ancestry before the 1920 records fire.
African Americans hoping to identify their ancestors and connect with descendants
People tracking matches to locations, such as specific villages in Europe.
People tracking US colonial records where their brick walls occur.
People seeking unknown ancestors in locations where records have burned.
Native American researchers seeking connections before the adoption of European surnames, often in the late 1800s.
Acadian matches from before the 1755 “Grand Derangement” when the Acadians were forcibly evicted from Nova Scotia
New Mexico and Southwest US connections to early Spanish families
Hawaiian researchers’ connections to Native Hawaiians

The keyword here is “pursuing.”

No single match should be taken as proof of anything, certainly not at this level. Cumulative evidence is another matter.

DNA evidence is just like every other type of evidence. We research and build upon what we find. Sometimes we discard what we’ve found when we find it to be invalid. We learn how to evaluate the evidence we discover. DNA isn’t any different. But we must have that evidence before we can evaluate it.

I wrote about that in Ancestors: What Constitutes Proof?

Genealogy Goals

What you’re trying to accomplish with DNA testing will determine whether or not smaller segments are important to you. One size does not fit all – pardon the pun. Your goals may also change over time. Mine certainly have as I moved from confirming existing line to attempting to break down brick walls that no one has the answer to today.

Researchers have different goals for DNA testing in conjunction with genealogy. Working with smaller segments isn’t for everyone.

Many people who only want to confirm known ancestors and have no idea why or how smaller segment matches might be valuable to themselves, now or eventually, or to others. Adoptees looking for their biological parents don’t need or want those small segment matches In general, smaller matches, unless they have a tree posted with a shared ancestor, require more work and are typically used by more experienced genealogists.

Let’s take a look at the various categories of research, which might explain why someone you’re talking to might have a different opinion about matches between 6-8 cM, or might be ambivalent.

Research Type or Interest	Applicable DNA Research/Comments
Ethnicity and populations	Ethnicity and population reports are available at all 4 major vendors, plus sometimes additional tools. People who test for ethnicity may not be interested in traditional genealogy or DNA matching.
Adoption or unknown parent searches or other close relative searches (grandparents, etc.)	People searching for close family members focus on close matches beginning with their highest matches, then tree matching, not generally more distant matches. I wrote about that here.
Confirming known ancestors already in your tree	Confirmation occurs by matching to (and triangulating with) multiple other testers who share common identified ancestors. Tools like Theories of Family Relativity (MyHeritage) and ThruLines (Ancestry, but no triangulation) automate this process as does Phased Family Matching (FTDNA), in addition to some third party tools.
Discovering previously unknown ancestors that someone else has already researched	DNA matching and advanced tools such as ThruLines (Ancestry) and Theories of Family Relativity (MyHeritage), but these tools require that someone already has identified these ancestors and placed them in their tree.
Discovering unidentified and previously unknown ancestors, meaning those where records don’t exist, are not previously researched/documented and are not already in someone’s tree.	Every generation back in time increases the number of brick walls that genealogists hit. A researcher born in 1980 is likely to be 4^th cousins to someone born from a common ancestor in 1850. Some 3^rd and 4^th cousins won’t DNA match at all, some will match on larger segments and some will only match on smaller segments (6-8 cM). The number of people who match and the segment size (generally) decreases in every generation as the DNA is divided.

If you’re thinking to yourself that you have ancestors that are entirely brick-walled – then you’re a candidate to utilize matches between 6-8 cM. Remember, roughly half of those matches are valid, and yes, there are evidentiary tools and methods of evaluation.

If you’re not back to brick-walled ancestors in your research yet today, eventually you will progress beyond available paper records and will find yourself in need of DNA. If the only DNA that you carry from those ancestors are segments between 6 and 8 cM, and they’re gone – you’re entirely out of luck. Just like when the Irish Records office burned in Dublin in 1922.

Ancestry Irish records office fire.jpg

Doesn’t that picture just hurt your heart, understanding the magnitude of the history that is burning?

DNA is the Currency of Our Ancestors

I’ve been searching for how to describe the situation people with brick walls, no surnames, and no written history face.

Think of your ancestors’ DNA as genetic currency.

You have large bills that represent what you received from your parents. As you move further back in time, those bills become 20s, then 10s and 5s. Finally $1 bills. Then, change.

The problem is that some people know which bill, meaning what ancestor that change came from, because they can track it directly backward in time, bill to bill, and ancestor to ancestor. Their change is all stacked in nice neat ancestor piles because they have the records to connect them to other descendants that know that ancestor is theirs too.

Ancestry coins

Other people who don’t have the benefit of that knowledge just have a bag of change all mixed together. They don’t’ know where those coins came from, and the coins, or smaller DNA segments, themselves, MUST point the way to the identification of their ancestors.

Ancestry coins pile.jpg

While their pile of change is messy, there are tools for researchers to sort through the coins and organize – identifying which coins came from which ancestors. Tools like shared matches, clustering, and more.

If you take their coins away, researches who have hit brick walls, which we all eventually do, have no genetic currency to work with.

Franklin Smith, an African American genealogist at the Clayton Library in Houston shares his experiences on Dana Leeds’ blog, here.

Ancestry Delayed the Purge for a Month

Ancestry’s decision to purge matches of 6-8 cM is critically important for brick-walled genealogists because, in part, of the sheer magnitude of their database.

Let’s say, for example, that we need to find a minimum of 10 people descending from this same couple through different children before we’re comfortable that this connection is valid.

If we can find 10 people at Ancestry, in a smaller database, we may only be able to find a few – certainly not nearly 10. If that database doesn’t provide matches to 6 cM either or has an arbitrary match cutoff, we may not be able to see those matches elsewhere either. Furthermore, not everyone tests elsewhere or transfers their DNA file. That’s exactly why it’s so critical to keep the Ancestry matches.

The combination of the 6-8 cM segment matches, more likely to be accurate because of phasing and Timber, and the large number of testers at Ancestry provides us with an increased opportunity to be successful.

Ancestry has not communicated with me directly, but I was provided with this posting from the Ancestry Facebook page wherein the “author” with the Ancestry logo by their name states that they are delaying the purge for a month, until the beginning of September. That’s good news, but clearly not enough news.

Ancestry posting

Please note that Ancestry:

Has delayed the purge until “late August”
Has clarified that starred matches (in the groups) are saved
Is beginning, soon, to show decimals so you don’t have to save all 8 cM matches in order to be sure you save all 7 cM matches due to Ancestry’s rounding up.

Earlier today, the “Learn More” link at the top of the DNA matches page has been updated with the following information, which confirms the Facebook posting.

Ancestry FAQ

I am hopeful that Ancestry is still evaluating its overall decision and instead of a mass purge, will provide more effective tools for their customers to utilize.

I can think of several, but the first approach would be that if a match does not phase with parents, assuming both have tested, it should be removed, regardless of the size.

Providing genealogists with analysis tools, similar to the now-banned third-party tools, would be a wonderful addition. Just un-banning those tools is really all we need.

Allow genealogists to flag some matches for deletion which we have determined are not valid would be beneficial. Similar to “ignoring” incorrect records hints.

Provide Feedback to Ancestry

Ancestry provided roughly a month’s grace period to allow users frantically struggling to save their relevant 6-8 cM matches some relief. I provided preservation strategies and instructions for how to prevent matches from being deleted, here.

This temporary reprieve doesn’t address 6-8 cM matches that exist today and aren’t saved, nor future 6-8 cM matches.

Please continue to provide polite feedback to Ancestry.

Feedback channels include the following:

Email Ancestry support at ancestrysupport@ancestry.com.
You can initiate an online “chat” here.
Call ancestry support at 1-899-958-9124 although people have been reporting obtaining offshore call-centers and problems understanding representatives. You also may need to ask for a supervisor.
Ancestry corporate headquarters phone number on the website is listed as 801-705-7000.
You can’t post directly on Ancestry’s Facebook page, but you can comment on posts and you can message them.
Ancestry’s Twitter feed is here.

Someone pointed out that the chromosome browser petitions initiated a few years ago went exactly no place, but like I mentioned previously, it’s a lot easier to keep something that exists than it is to build something new. I’m still hopeful that our voices will make a difference this time!

If you’d like to sign petitions, at least three have been created:

What’s Next

I’ve had requests to review what methods and tools available at each testing vendor to assist genealogists who need to search for unknown, undiscovered, previously unresearched ancestors. That’s a great idea!

After Ancestry completes whatever they decide to do and things settle down a bit, I will write a series of articles about how to utilize the various tools offered by each vendor that can be utilized by brick-walled researchers – along with suggestions for improvements every vendor can make to improve our chances of success.

Eventually, all genealogists will move beyond ethnicity or confirming documented ancestors into the realm of the unknown where we need every piece of genetic currency that we can find – along with advanced analysis tools to help us sort the wheat from the chaff and assign names of ancestors to those DNA segments.

The best thing Ancestry can do for us, right now, is to NOT delete those matches. The best thing you can do is to share your opinion with Ancestry.

_____________________________________________________________

Disclosure

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers

FamilyTreeDNA – Y, mitochondrial and autosomal DNA testing
MyHeritage DNA – ancestry autosomal DNA only, not health
MyHeritage DNA plus Health
MyHeritage FREE DNA file upload – transfer your results from other vendors free
AncestryDNA – autosomal DNA only
23andMe Ancestry – autosomal DNA only, no Health
23andMe Ancestry Plus Health
LivingDNA

Genealogy Products and Services

MyHeritage FREE Tree Builder – genealogy software for your computer
MyHeritage Subscription with Free Trial
Legacy Family Tree Webinars – genealogy and DNA classes, subscription-based, some free
Legacy Family Tree Software – genealogy software for your computer
Charting Companion – Charts and Reports to use with your genealogy software or FamilySearch

Genealogy Research

Legacy Tree Genealogists – professional genealogy research

113 thoughts on “Ancestry Match Purge Update”

geebee1015 on August 21, 2020 at 9:00 pm said:

So I’ve been going through some of the small (under 8.0 cM) matches I saved. I only kept ones with “common ancestors”.

What I’m seeing is several matches that are reported to share only a single segment yet have a “longest” segment of 9 cM or more. Now, obviously they’re rounding the cM for the longest segment, but if there’s just one matching segment and it rounds to 9 cM it must have been 8.5 cM or more to do so.

So what gives?

Loading...

Reply ↓
- Roberta Estes on August 22, 2020 at 3:18 pm said:
  
  Ancestry has a mess right now.
  
  Loading...
  
  Reply ↓
Gary Bookhammer on August 22, 2020 at 1:40 pm said:

What especially bothers me now that Ancestry is reporting “longest segment” is how often the number is greater than the reported total sharing. It’s been suggested that this is probably because Ancestry is using the “pre-Timber adjustment” amount in reporting longest segment.

In some cases, I’ve even found instances in which the “longest segment” was larger than 20 cM! In most cases, the only difference was that the “longest segment” is reported as the nearest integer, rather than in tenths.

One match I found is to a known 3rd cousin. The shared amount reported by Ancestry is 7.5 cM in one segment, so before their decision to stop rounding to the nearest whole number it would have been displayed as 8 cM. Of course, that could have led me to believe it was “safe” from deletion.

The “longest segment” as reported by Ancestry is 13 cM. This is close to twice as much as the “adjusted” total. So if Ancestry wanted to be fair — or at least, fair-ER — they would refrain from deleting those matches which would be above 8.0 cM if not adjusted by Timber.

But I think we all know Ancestry’s motivation isn’t fairness to customers.

Loading...

Reply ↓
Gary Bookhammer on August 22, 2020 at 3:52 pm said:

Right now this “longest segment” business is being very informative. My daughter has a 4th cousin once removed that Ancestry predicts as her “5th to 8th cousin” based on sharing of 15 cM across 2 DNA segments.

The thing is, Ancestry is now showing “longest segment” as 58 cM, and the actual relationship is 4th cousin once removed. Clearly, if a single segment is already larger than Ancestry’s combined total cM for this match, it has been downgraded by *something* — likely Timber.

My own sharing with this person — 4th cousin to me — is given as 30 cM. But my “longest segment” is reported as 64 cM. That’s over double what Ancestry reports as the total sharing, yet still believable for 4th cousins.

As much as Ancestry needs to rethink deletion of all these small (under 8.0 cM) — which doesn’t apply to the matches I’ve just mentioned — they also need to STOP using Timber. Or at the very least, they should report both the pre- and post-Timber amounts.

I’ve seen now dozens of my matches, my wife’s matches, and my daughter’s matches that show a total *less than* 8.0 cM and yet have a longest segment which is *greater than* 8.0 cM. Some of them have a longest segment in excess of 20 cM, yet presumably Ancestry will be deleting them on the basis of a potentially phony “post-Timber” adjustment.

Loading...

Reply ↓
Roberta Estes on August 23, 2020 at 9:04 pm said:

Please note that a new script by Earl Hauks is found here (open the comments) and runs better than the original script. Wicked fast too. The only modifiction you need to make is to change the group name at the bottom in the last line between the quote marks or, conversely, create a group titled Distant Relatives. https://www.facebook.com/groups/407494112747727/permalink/1756378334525958/

Loading...

Reply ↓
Gary Bookhammer on August 29, 2020 at 6:43 pm said:

So I now know — because I took the time to count them — that I have 551 “small matches” (total sharing between 6.0 and 7.9 cM) if I limit myself to only those with “common ancestors”. (If I tried to count *all* my small matches, I’d run into the thousands.)

Of these, 280 — a little more than half — have a “longest segment” of at least 9 cM. What that means is that over half of these matches would be “safe” if not for Timber’s downward adjustment of “cM”.

And what’s the basis for this adjustment? Ancestry’s statistical determination that the match is “too matchy”. But again, the Timber algorithm cannot know that in any give case that this is actually true. It only knows that there *may be* “excess matching” in the region.

As a matter of fact, Ancestry actually knows that Timber is sometimes wrong. This is why Ancestry suppresses Timber in certain cases, such as when initial matching is at least 90 cM. The truth is, real matches often do overlap with “pile up” regions — and matches within these regions may be as “real” as anywhere else in the genome.

In fact, I’ve looked closely at many of my matches for which I know the relationship, and generally the “pre-Timber” sharing seems as plausible — or moreso — than the “post-Timber” amount. In other cases, it doesn’t seem to make a great deal of difference. (In which case, what makes Timber “necessary”?)

I might also note that 30 of these “small matches” show a “longest segment” of over 20 cM, including one which has an adjusted total cM of only 6.0. That’s only half of 1%, but keep in mind that this means the “unadjusted total” would qualify the match to be included among my shared matches.

Also, some of the matches shown with a “longest segment” of 8 cM might also have been “safe”, since some of them would likely be at least 8.0, since Ancestry rounds the “longest segment” number. In addition, if there is more than one segment we don’t really know what the unadjusted total would be.

So it’s possible that many more of my “small” matches would actually be above 8.0 cM if not for Timber.

Loading...

Reply ↓
- Doug Bank on August 31, 2020 at 2:08 am said:
  
  Is there a blog post or article somewhere that explains Timber and “pile up regions” and other terms like that??
  
  Loading...
  
  Reply ↓
  - Roberta Estes on September 1, 2020 at 3:35 pm said:
    
    If you use the search box on my blog, you will find some. Ancestry also published a paper that included information about Timber.
    
    Loading...
    
    Reply ↓
- lindarhorton on September 1, 2020 at 1:57 pm said:
  
  On August 15, I had 94,404 matches. Today I had only 53,563 matches, a loss of 40,841 matches. I did not attempt to use the scripts but instead sought to salvage all matches with common ancestors (ThruLines). I also followed Jim Bartlett’s advice and used AncestryDNA’s search function to identify matches with ancestral surnames and locations.
  
  Although I am remorseful that I lost so many matches in the 6-7 cM range, I am also aware that few of the lost matches were in possession of information that could have helped me break through brick walls in my tree .All my ancestral lines were in British colonial America before 1760, so I am deeply affected by the poor records on both sides of the Atlantic during the 18th century and earlier. Although the small matches are important to someone with my colonial-era ancestry, the potential for a match to share various common ancestors means that I cannot be certain which ancestral pair contributed the shared DNA segment.
  
  I am going to take a break from AncestryDNA for awhile and work on other genealogy priorities!
  
  Loading...
  
  Reply ↓
Jeffrey Crowe on August 31, 2020 at 2:47 pm said:

I hoped Ancestry would have come up with a more nuanced approach to the purge. When I first started to archive (star my 8-6 matches without a group ) I thought a month ago I might have 10-15% of my total matches falling into that subset of total matches. Now having worked (clicking madly– up to about 30K newly archived matches) I realize I will probably not complete my goal of saving my current matches. I keep, in a rough in ready sort of way, upping my total of matches that fall in those last 2 CM s. Wow what a big number. I am estimating about 50% of my total current matches might fall into those two CMs(7.9-6.0). I can always come up with a reason to try and save them all… ” a no tree match might add a tree in the future.. to “a 2 person tree might add more people in the future”. Alas, time to triage.
When I said a “more nuanced approach” above I meant something like maybe a current or future low CM match might be allowed in as a match if they fulfill certain criteria. For example: say you have a current match. A good solid Thrulines match at 12CM. You have researched and yes they are a 5th cousin also via the paper trail. So you get a second match say at 7.3. In the post Purge world they would not qualify as a match on the revised list. The second match is a child of the first match. The DNA shared between them is very high. So one would think that match #2 could be “grandfathered- grandmothered” in as a match to you. You match A and A matches B strongly, therefore B is also a match despite sharing a substandard amount of DNA.
I tend to think that nothing is irreversible. So initially, someone loses 30k matches off their list. Some could come back later if as I suggested a better relationship formula is devised. Ancestry is still going to have all those millions of DNA testers in their system so plug in a new formula and maybe some could come back into the fold.
Well, I’m not holding my breath about that one.
So I recently acquired a new Thrulines match at 6CM. I looked into it and yes a 5C1R. Now in my tree. Doing this archiving I discovered I also have a second match a son of the mother. This man doesn’t show up as a Thrulines hint, despite the two of them having the same large tree. I added him into my tree with both parents each with a birth date. Still no Thulines. Also, the son shares more CM with me (7.4 1 seg. “longest segment 9CM) than the calculated number Ancestry has given me for the mother (6CM 1 seg. longest segment 8). His official CM number is still falls within that “longest segment of the mother”. Perhaps recombination? Weird.
Back to the salt mines…………

Loading...

Reply ↓
Jeffrey Crowe on September 1, 2020 at 3:03 am said:

ViVa La Purge! (or not). Woke up after a little snooze………. Occurred tonight. So with extreme archiving approx. 31.8K saved. I lost 7664 (.97%). My wife with minimal archiving lost 33,720 (57%) . One of her 5th cousins with minimal archiving of matches lost 31,284 (52.47%). My loss would have been close to 50% without any of the archiving of matches.

Loading...

Reply ↓
- Roberta Estes on September 1, 2020 at 5:48 am said:
  
  Mine would have been about the same. So grateful for those scripts.
  
  Loading...
  
  Reply ↓
Gary Bookhammer on September 1, 2020 at 7:59 am said:

I know we’re talking about Ancestry’s decision to drop matches in the 6.0-7.9 cM range, but I still think we need to focus on just how much Timber is compounding our problems.

As I’ve noted in other comments, a lot of these matches that Ancestry wants to get rid of may not actually be as small as Ancestry claims they are. Many of my small matches show a “longest segment” which is not only more than 8.0 cM, in some cases it’s even more than 20 cM.

But I have an even more egregious situation involving a couple of my matches in the low 20s. Of course this means they’d be “safe” from Ancestry’s axe, but I mention them because people may not understand just how much of a difference Timber is sometimes making.

I’ll call these two matches DC and NM. According to Ancestry, DC matches me for a total of 21 cM across five segments. That’s a lot of segments for just 21 cM — they wouldn’t average much more than 4 cM each. However, Ancestry reports the longest segment as 35 cM.

In the case of NM, Ancestry reports sharing of 20 cM in two segments, with a longest segment of 39 cM. Obviously, Timber has “reduced” this segment, supposedly for “excess matching”. But is Timber correct?

Well, it turns out that both DC and NM also tested at 23andMe. 23andMe says that DC matches me for 90 cM in ten segments, and NM matches me for 47 cM in three segments. My longest shared segment with DC is 35.10 cM — which is nearly exactly the same as Ancestry’s reported length for one of the segments. My longest shared segment with NM is quite a bit less than Ancestry’s reporting, only 20.69 cM.

*But*, there’s only a very small gap on the same chromosome before there is *another* shared segment with NM, of 17.78 cM. If you add these two segments together, you get nearly the same number as Ancestry’s longest segment. Obviously, Ancestry is seeing these two segments as just one — which may very well be correct.

Here’s the deal, though. I can compare DC and NM directly to each other in the chromosome browser. It turns out that their sharing with each other is 137 cM in ten segments. This means that under Timber’s rules, sharing between DC and NM should not be “discounted” between the two of them.

You’d also think it shouldn’t be discounted between DC and me, since Timber isn’t supposed to “adjust” amounts of at least 90 cM. Well, the problem is that my sharing with DC at 23andMe includes and 18.97 cM segment on the X chromosome — and Ancestry does not include any sharing on the X chromosome in total cM. So Timber took what Ancestry probably saw as only a 71 cM match and reduced it to a 21 cM match.

By the way, my longest match with both DC and NM is in practically the same location — and the two of them share a 35.24 cM segment with each other in that location.

I also compared both DC and NM to my daughter. My daughter’s sharing with both of these matches is nearly identical to mine (it might be off by tenths or hundredths of a cM). It certainly looks to me as if these are real matches, but I would never have known Ancestry discounted the match lengths apart from 23andMe’s chromosome browser and Ancestry’s inclusion of “longest segment”.

And I’d be willing to bet that before long Ancestry gets figured out that if they just report “longest segment” using POST-Timber numbers that they may be able to continue to keep us in the dark.

Loading...

Reply ↓
ALAN MOLL on November 25, 2020 at 1:10 am said:

Thank you for this information. I just offloaded (screen scraped) my 6-8 cM Ancestry matches. I wish Ancestry had a download/export feature for these matches.

Alan Moll
DNA Enthusiast — Constantly Learning

Loading...

Reply ↓
ALAN MOLL on November 25, 2020 at 1:46 am said:

Ooops! I looked at what I pulled off of Ancestry. 8 cM are still there, but 6-7 are gone 🙁

Alan

Loading...

Reply ↓
Pingback: Ancestry Only Shows Shared Matches of 20 cM and Greater – What That Means & Why It Matters | DNAeXplained – Genetic Genealogy