All About AI – What It Is, What It Isn’t, and Why It Matters

Posted on June 25, 2026 by Roberta Estes

This is the second article in the AI series. The first, Your Wonderful AI Assistant – Sometimes Wrong, Never Unsure, Always Convincing, explains why I’m writing this series and what to expect. I suggest that you read these articles in publication order, as they build on each other.

AI is neither inherently good nor bad. The outcome depends on:

How it is used
By whom
Capabilities of the (ever-changing) tools themselves
The understanding level of the “requester” and the “consumer,” both
Safeguards applied or neglected

About AI

Let me start by saying that I don’t love AI, and I don’t hate it. I’m neither an evangelist nor a doomsayer. I’m a realist. AI is a powerful tool, capable of remarkable things and spectacular failures. Understanding the difference and interacting appropriately are the keys to success or failure.

AI is simply a tool, and like all tools, it can be used for good or evil. AI has the potential to, and does, in some cases, make our lives easier. However, the bad guys and miscreants saw that potential early and have perfected it.

AI is all around us, whether you realize it or not, so don’t think you can just avoid it, because you can’t. AI exists in many forms and is here to stay. We need to educate ourselves so we can reap some of the benefits and avoid the pitfalls.

Education and increased vigilance are the only ways to protect yourself, and I mean vigilance incorporated into the very fiber of your being. No more, “that looks interesting” and clicking without thinking. It’s so easy to do.

When I talk about AI safety, I’m referring to two types of safety.

Using AI tools for reliable results, and how to determine when you’re receiving or consuming something questionable. AI failures occur often and are both irritating and misleading, but not always obvious.
Literally protecting yourself from danger. This includes recognizing when AI is being used without your knowledge and how to protect yourself in the new threat landscape. I am not overexaggerating.

Unfortunately, AI safety is a sliding scale, progressing from one end of the spectrum to the other. There’s not always a clear delineation between correct and incorrect, safe and unsafe, or between different types of AI. As I am wont to say, “It depends.”

Learning about AI, both in general and in specific contexts, is critical. Not yesterday’s AI – but AI right now, because both the AI tools and AI’s capabilities are changing at lightning speed.

We all need to up our game and retrain ourselves to always stop and think first.

AI and You

There are essentially three ways people encounter or interact with AI.

You’re actively using AI as a tool, such as ChatGPT, Claude, Gemini, or others. This is generally safe from an actual danger or “threat” perspective, particularly because you are in the driver’s seat. However, there are aspects you need to be aware of – especially if you’re a novice. I’ll explain methodologies to use AI to (hopefully) increase your productivity and save you from following AI into the underbrush of falsehoods, inaccuracies, and misplaced confidence. In other words, so you don’t have to say, “Wow, was I ever an idiot,” too often.
You’re unknowingly interacting with AI. Sometimes this is fine, but it can open the door to inadvertent reliance on incorrect information and therefore various forms of harm. Sometimes, harm rises to the level of actual danger. Understanding when you’re interacting with AI, understanding its limitations, and recognizing danger signs are important aspects of staying safe.
The AI threat landscape. AI can be dangerous and used against you. I mean screaming-red-neon-flashing-sign hair-on-fire dangerous, and I’m going to explain this new threat landscape and how to improve your chances of being safe, primarily in the final article of this series.

I Use AI, But There Are Limits

I hold a graduate degree in Computer Science and have years of experience in the technology industry where security is both essential and critical. That background, while preparing me generally, cannot prepare one for the situations and well-hidden threats we now encounter every day. Being overconfident and overreliant on prior experience is foolhardy and a sure way to get burned.

The one thing that’s constant in the computer industry is change. The underlying fundamentals remain the same, but everything else changes – and AI is morphing rapidly.

I’ve been using AI since the beginning in a very restricted, measured way. I use AI regularly, tactically, and cautiously, with huge guardrails. I started out by taking classes from Mark Thompson and Steve Little, AI experts in the genealogy space, to learn how to use AI productively. That was a couple of years ago, and the entire landscape has changed since then. I make it a priority to stay current.

In the next article about using AI safely, I’ll share recommendations for training and education from Mark and Steve.

AI tools are trying to emerge from their terrible toddler stage and morph into early teens, but they relapse a lot! Sometimes AI is very helpful, sometimes wrong, and often frustrating – interspersed with amazing victories where AI helps us immensely.

Unfortunately, often it’s almost impossible to tell which is which.

Inspired by a posting in the Facebook group, Genealogy and Artificial Intelligence. Image is AI generated and appropriately labeled as such.

Here’s the caveat – I know I’m using AI. I’m not accidentally interfacing with a Chatbot, thinking it’s a human. I’m not reading something someone else posted and believing I’m reading about an experience that’s true – when it’s AI-created fiction. The question, of course, at that point, is WHY someone created it and posted it in a way that conceals its true origins.

My AI usage is intentional. I know how to be vigilant, generally what AI can and can’t do, and that I absolutely positively MUST fact-check everything. Often, I inadvertently push the limits of AI, thinking it can perform more than it can accurately, which is another reason everything must be checked. As genealogists, verifying sources should be second nature.

If you’re going to use AI, it’s essential that you do the same thing.

So, what, exactly, is AI?

What is Artificial Intelligence?

This is really a difficult question to answer, because AI has been more of a slow evolution, followed by a rapid acceleration of technology – not a specific “thing.” That acceleration occurred when standalone AI tools like ChatGPT, which we know are AI because they are specifically called that, were introduced and made available to the consuming public.

We’ve been using computers for decades now, assisting us on platforms from mainframes to PCs to tablets. Today, our phones are more powerful and useful than early mainframes.

AI is the latest in the cadre of applications, a type of tool that can either stand alone or be embedded in other software tools for specific tasks. Think Chatbots for business websites.

While AI is beginning to be “everywhere,” it’s not a universal scapegoat.

Two years in, AI is being blamed for everything. While AI does make a lot of mistakes, many issues aren’t a result of AI, and it’s not fair to presume they are. Let me give you two examples of what is and is not AI.

Not AI – Someone tried to enter text, meaning alphabet, in a field meant exclusively for numbers, like a month field that’s supposed to be a number and not the month name. The person was angry because “AI was wrong” and prevented the erroneous entry. First, it wasn’t wrong, and second, it wasn’t AI.

One of the earliest computer uses was to parse date fields and ensure that the “right thing” was being entered in the correct place. In this case, a numerical month, not the month name. That’s not AI. That’s just plain old-fashioned programming error-checking that’s been a part of software for decades. The program was performing exactly as it was intended.

AI – I submitted a spreadsheet to ChatGPT and instructed it to move all of the data in cells in column A that are entirely numeric to the same row in Column B, and to leave everything that contains any alphabetic characters where it is in column A. That’s AI, both because I’m using a known AI tool, and it’s processing my instructions to produce output that did not exist before.

The above image is what I wanted. I completed this by hand to show you what I had in mind. Working by hand is fine with 8 rows of data, but it wouldn’t be fine with 1000 rows, or more. That’s when you need a tool.

What could go wrong? Plenty.

Let’s say that I didn’t provide specific instructions and a cell contained mixed alpha and numeric, like Jane2. Or, if the tool just plain messed up because of some other unknown reason – such as the file being too long, or it misinterpreted an instruction. That’s why you have to verify everything.

With AI, it’s always some variant of the wild west frontier.

Next, I submitted my Before and After spreadsheet, above, and instructed ChatGPT to “Please put this in a chart and make it pretty.”

This is exactly what I received.

I didn’t receive what I wanted, because I didn’t tell the AI tool specifically what I wanted (spacing, color, font, size), and what I didn’t want. This isn’t a problem with the AI tool, it’s a problem with the instructions provided by the “driver.” AI is not a mind-reader, at least not yet.

Hint: When I don’t receive what I wanted, I tell ChatGPT what I wanted and ask it why I didn’t receive that, and what instructions I could provide differently. In this case, I learned that it can’t “discern colored text” (red) and only sometimes can “see” bolding.

This was a very simple comparison of AI versus non-AI. Of course there are endless variations, but in general, AI does something that produces something new or different or in another format – based on conversational instructions.

Examples of what AI can do well:

Take notes and summarize online meetings
Organize information into outline format
Suggest structure
Proofread and sometimes provide editing suggestions
Suggest places to look for additional information
Translate, transcribe and summarize both typewritten and handwritten documents, in multiple languages

Every one of these comes with a caveat. AI can always be wrong. Like any helper or intern, it’s up to us, as the responsible party, to be, well, responsible by monitoring and verifying everything.

Being wrong in places does not mean the tool isn’t useful. AI can transcribe an entire document in seconds, but I need to proofread it against the original. That’s a significant time savings for me. AI can then assist with the logic of how people are related to each other. That doesn’t mean it’s accurate, but it’s a place to start.

We have to learn how to communicate with our intern in a way it can understand to (hopefully) receive the output we want, and we have to confirm that it is.

The more difficult and complex the task, the more difficult the verification.

GIGO

The overarching theme for all computer data is GIGO – garbage in, garbage out. I know everyone can think of hundreds of examples that have absolutely nothing to do with AI. It’s the same now, but on steroids because we add the layers of:

Our instructions to AI, which may or may not be as thorough as we thought
AI interpreting what it thought we said, according to its internal rules and limitations that we don’t understand
AI manipulating data and producing output on our behalf

Additionally, when we ask AI to gather information about something, it can only gather what it can see. For example, some AI tools cannot reliably open weblinks, while others can. Some, like Google have internal routines to rank sites that are more reliable and accurate, and other tools do not.

Asking your AI tool for it’s sources so you can evaluate the GIGO factor is essential too.

Drinking From the Firehose

You might think AI is completely new, but it really isn’t. What’s new is the label of AI and consumer-based products where you get to be the driver.

Think of AI as the big umbrella.

In the past decade or so, artificial intelligence models have been slowly being developed, often for specific use cases. Machine learning models that are self-teaching are good examples. Genetic imputation to equalize autosomal DNA files produced by different vendors before matching is a specific use case.

Traditional programming is very specific and instructs, “If X, then Y.” Imputation, within a limited range of options, says, “Based on X, I think Y is most likely next character.” Machine learning learns by example. AI is the next generation where answers to questions are not hard-coded or self-learned in the same way.

With AI, one could interact and say, “Based on X, what do you think is next, and why?” The answer would be conversational, and would explain how the AI tool got to the result of Y. That doesn’t mean Y is accurate.

Before AI, consumers had never been in the driver’s seat, with the ability to query computers easily about anything with no programming needed – receiving conversational answers in their language of choice. Answers that are hopefully accurate.

Back in 2011, Siri became available, Amazon Alexa in 2014, and Google Assistant in 2016, but these were all command driven with a restricted vocabulary and could only perform limited actions.

In October 2022, ChatGPT introduced us to a new world, triggering the AI boom. By late 2023 and early 2024, suddenly the term AI, artificial intelligence, snowballed and was everywhere. The early versions of AI tools could only do a fraction of what they can in 2026, and could not perform tasks on your behalf.

ChatGPT prompt: “Make me a fun goofy picture with a cat that illustrates the ability of AI to make a fun goofy picture.”

Today that has all changed and it seems like everyone is making goofy pictures for fun.

Artificial Intelligence is NOT Intelligent

Let me say this loudly – artificial intelligence is not intelligent!

AI is a computer – electronic pulses in a data center somewhere. AI is trained to gather massive amounts of data, distill it in specific ways, and then, using various types of skills, interact with humans in a helpful manner. “Helpful” depends on perspective.

This field, as a whole, is really still in its infancy. That’s both the bad news and the good news.

AI tools are “new,” exciting, and frightening all at once. AI has enormous potential, but it also creates opportunities for misuse, deception, and unintended consequences.

I’m not referring to water and electricity consumption and the impact of building thousands of data centers on the environment. I’ll let you decide for yourself on that one.

Risks include:

Frequent errors
GIGO
Results being presented overconfidently by the AI agent
Faulty results being believed by the consumer (that’s you and me) with the same level of overconfidence, and without verification
Social engineering – meaning the manipulation and influence of people by bad actors
Extremely dangerous, highly malicious manipulation and applications in ways not possible before

The entire AI landscape is complicated by a lack of public understanding and made even more challenging by the extraordinary pace of this technology’s evolution.

Multiple Types of AI

There are multiple types of AI, ranging from Machine Learning models to full-blown Generative AI that creates goofy cat images for you. For the most part, today, we’re talking about LLMs and Generative AI.

Large Language Models, called LLMs, are artificial intelligence tools, like ChatGPT or Claude, that are designed to process human-like text or speech and generate output in the same way. AI doesn’t just give you a list of resources that you evaluate yourself, like a search engine; it gives you an “answer” (such as it is), writes text, and has an interactive “conversation” with you.

How does that happen?

The AI tool at the data center aggregates and amalgamates data based on your input and its training, then predicts the words most likely to come next, in what context, and how those words relate to each other.

That’s how AI forms an “answer.”

This is how and why AI, specifically LLMs, can write essays on a topic, create entirely fictitious but highly engaging social media postings and stories that aren’t presented as “stories,” but as someone’s personal experiences, meaning as “truth.”

AI, or the people who generated that AI script, or both, present fictional results with great confidence, often beautifully, and far more convincingly than humans.

This is where it’s important to differentiate between the tool itself, and the “driver,” meaning the human that’s prompting the AI tool.

The driver needs to prompt AI correctly and verify the output.
AI, the tool itself, sometimes generates incorrect information, often regardless of the prompts provided by the driver.
Sometimes the AI tool performs exactly as instructed, but the driver requested something “improper.” By improper, I don’t mean inadvertently or by accident.
Sometimes the human is unethical.
AI isn’t a sentient being and doesn’t understand the difference.

The human decides what to do with AI-generated results. Many times, AI-generated text, recognizable by word patterns or other characteristics (today), is posted to social media as “original” or factual, and contains incorrect information.

This is often referred to as “AI slop,” as one of the nicer terms, especially by those of us who increasingly find incorrect but convincing AI slop posted as “helpful information” and positioned as “expert,” even though it contains substantial inaccuracies.

Worse yet, very convincing AI slop can easily be generated to part you and your money.

And do I EVER have an example for you that combines AI slop and ethics.

AI SLOP and Ethics

Just two days after our new paper, on which I’m a co-author, Mitotree: The Universal Human Mitochondrial Reference Phylogeny at 10x the Resolution, was published, a company, whose name I’m not including because I don’t want to give it any oxygen or get it indexed with this article, posted a “beautiful” AI poster based on our paper – without our knowledge.

Looks nice, right?

To begin with, it appears for all the world like the authors provided this infographic, which we ABSOLUTELY DID NOT DO. Our names are right at the top. However, our names, as the paper’s authors, lend this “thing” credibility, thereby leveraging our work BOTH unethically and inaccurately.

This AI-generated infographic, although it’s not labeled as such, was created by a third party shortly after the publication of the Mitotree paper. While visually impressive, it contains several scientific inaccuracies, illustrating how quickly and easily authoritative-looking but incorrect content can be created and disseminated.

That’s one of the issues with AI – the beauty and professional appearance of AI-generated “things” encourages unwarranted confidence in the output, when the information is very wrong.

That’s why humans bear the responsibility of BOTH using AI ethically, AND verifying its accuracy. It’s also why, as consumers, we need to question everything.

My biggest issue with this situation isn’t with AI, other than the fact that it generated incorrect output – the issue is with the humans who intentionally created this, using AI. In other words, the drivers.

The infographic doesn’t say they created this incorrect rubbish, and I assure you, they never asked for permission. Then, they published the infographic on their own blog. In case you’re wondering, the company encourages uploads and charges people to get “new results.”

Now for the AI part.

The information IS WRONG and NOT a synthesis of what we published!!!! This infographic shows that all non-L haplogroups descend from haplogroup L4, which is absolutely FALSE.

Haplogroups M and N descend from haplogroup L3, and haplogroup R descends from a subclade of N. You can trust me because I’m one of the paper’s authors, or better yet, you can look for yourself, here, on Discover, or here, here, and here.

That isn’t the only thing that’s wrong, either, but how would normal air-breathing humans, meaning consumers, ever know?

Doesn’t that infographic look professional and convincing, especially if you, as a consumer, didn’t actually check everything on the document – AND its authenticity?

You’d assume legitimacy, right?

If you didn’t know, wouldn’t you be impressed with the expertise of the company that posted this infographic on their blog? And, as a normal consumer, how would you know?

You’d be impressed because you didn’t realize they hijacked someone else’s work, created this “beautiful” infographic, included the authors’ names on something inaccurate that the authors knew nothing about and didn’t endorse, and then published it. All without saying one word indicating that the infographic isn’t the authors’ work, was AI generated, or by whom.

In the past, before generating AI slop was this easy, consumers often presumed that a business was ethical and accurate. Of course that wasn’t always true, but being convincing at first glance is much easier today. Also, presume is related to assume…and we all know the rest of that story.

This is one of the dangerous sides of AI – illustrating how easy it is to deceive people now. It’s increasingly difficult to distinguish between legitimate expertise and fabricated authority. AI has removed that barrier.

You can no longer accept that anything is what it appears to be unless you’re working directly with known, trustworthy entities. The offending company completed that infographic in the click of a button and the blink of an eye, while I hadn’t even finished writing my own article about the paper’s release.

That company wants you to upload your DNA to them so that they can tell you “things” about your DNA. The intention is clear.

Of course, the consuming public, unless they were extremely vigilant, would never figure out either issue – ethics or accuracy.

I had to delete the next paragraph or two that I wrote on the topics of ethics, trust and confidence because I’m still so furious. Hot under the collar doesn’t even begin to describe how I feel about the ethics of misrepresenting something that we authors just spent six years of our lives on. Trust me when I tell you that my internal monologue was both very salty and rather spicy!😊

However, there’s good news. This infographic provides a perfect illustration of both AI slop, how deceptively great it looks, the ethics surrounding AI usage, and how difficult AI is to discern.

In fact, I couldn’t have come up with a better “bad example.”

A six-fingered hand, misspelled words or three arms in an image are obvious, and are yesterday’s AI tipoffs.

A misrepresented phylogenetic relationship or an incorrect founder-clade example is not obvious. Only subject-matter experts would or could notice if they were focused and paying attention.

That’s the problem in a nutshell.

The infographic wasn’t obviously wrong. It was convincingly wrong.

And convincing wrongness is far more dangerous than ridiculous wrongness, like six fingers, because most readers never realize they’ve been misled. Or why.

This single example demonstrates several AI themes in one fell swoop:

AI-generated content
Ease of creating complex and convincing output
Apparent authority
Misplaced trust
Lack of topic expertise
Overconfidence
AI slop
Difficulty of discerning truth
Yesterday’s “AI clues” are gone now – like misspelled words
Marketing vs. science
The necessity of human review
The fact that human review is only effective when the reviewer actually understands the subject, and cares.
Ethics

Like with this example, often AI slop is interspersed with accurate information, and it’s impossible to tell the difference unless you actually DO DUE DILIGENCE AND VERIFY ALL OUTPUT.

Yes, all of it.

Don’t shoot the messenger!

Hallucinations

Next, let’s discuss genetic genealogy, particularly haplogroup information. Hallucination or hallucinating is the term used for when AI simply makes things up, which often sound extremely convincing.

There’s nothing AI can tell you about your haplogroup that reputable sources cannot – and AI can’t see behind paywalls or logins, into your matches.

FamilyTreeDNA has an article in their help center titled, Why AI Models Struggle with Haplogroup Analysis.

Unfortunately, I encounter more and more instances where someone uploads their DNA to a third-party site, or “asks AI”. They receive a (sometimes substantially) incorrect haplogroup in a completely different part of the tree, complete with convincing language, posts it publicly, and then decides to argue that the third-party site, (who probably uses AI), or their AI tool, is correct.

Let’s look at an example. The mitochondrial DNA haplogroup for the Native American Anzick-1 burial in Montana that dates from roughly 12,500 years ago is mitochondrial haplogroup D4h3a. There’s no dispute about that.

A tester uploaded their mitochondrial DNA to “AI” and was very confidently told that, based on their mutations, their results belonged to haplogroup A2ex. They don’t.

ChatGPT misinformation about Anzick-1 haplogroup

They were then informed that it was also Anzick’s haplogroup. Wrong again.

FamilyTreeDNA's Discover tool information comparing haplogroups D4h3a and A2ex

FamilyTreeDNA’s Discover tool comparing mitochondrial DNA haplogroups D4h3a and A2ex. Their common ancestor lived about 66,000 years ago.

Not only did AI report Anzick’s haplogroup incorrectly on a grandiose scale, those two haplogroups don’t share a common ancestor for roughly 66,000 years – specifically haplogroup L3 who lived in Africa. AI made a massive mistake.

But it gets worse.

ChatGPT incorrect information about haplogroup A2ex.

The AI “answer” continued for four pages, containing completely erroneous information. To begin with, A2ex is a haplogroup, and “ex” has never meant excluding.

That’s bizarre, and an example of AI making something up that is patently false, but sounds wonderful and very authoritative.

The term for this AI behavior is hallucinating. I’m not publishing the rest of this exchange because I don’t want anyone (or any AI bot), for one minute, to think any of it is accurate. AI even made up mutations, along with four pages of “fairy tale.”

The individual who received this information was so excited and proudly posted it, which in turn provided incorrect information for other consumers, and encouraged them to use a badly flawed tool. Then they proceeded to argue with the experts.

They were absolutely convinced because it “felt” true to them, and because they wanted to believe they had discovered something special, and were related to Anzick. Their comment was, “You’re wrong, because AI told me it was true, and I’ve learned a lot from AI.” I was quite exasperated, but also feel sorry for them and can’t help but wonder how much else of what they “learned” from AI is wrong too, but I digress.

Most AI errors aren’t obviously wrong to the consumer. If AI said that you were descended from Tyrannosaurus Rex, you’d laugh. But if it tells you something more plausible and sounds confident, it’s very easy to be convinced. The reason these errors are so dangerous isn’t because the experts are fooled, it’s because non-experts either can’t, don’t, won’t or don’t think they need to invest the time to discern the difference.

I find it a bit baffling why anyone would use AI, or worse yet, a pay site for haplogroup misinformation, especially since FamilyTreeDNA provides the Discover website with free reports for every haplogroup. They are the unquestioned industry phylogenetic experts for both Y-DNA and mitochondrial DNA, and literally created the reference model for all haplogroups with the Mitotree.

Everyone can use Discover to access both the Y-DNA tree and Mitotree – for free – here. Discover isn’t even behind a paywall, and every customer can click through from their results page.

As far as haplogroups are concerned, there’s really no reason to rely on AI-generated answers without verifying them, because the authoritative resources are freely available and incredibly easy to access.

FamilyTreeDNA’s Discover Ancient Connection for Anzick-1.

Regarding Anzick’s haplogroup, all I had to do was enter haplogroup D4h3a in Discover and under Ancient Connections, right there is Anzick’s information.

I may start posting a link to this article on every single post where someone starts out with, “I submitted my DNA (or haplogroup) to AI, and it said…”

Let me be very direct. Don’t believe AI when it has to do with genetic information, especially Y-DNA, mitochondrial DNA, and haplogroups. AI does not have the capability of understanding topology and nuances of phylogenetic trees, and can only parrot back what others have said – correctly or incorrectly.

Incorrect information that’s publicly posted is then fed back into the AI algorithm, further reinforcing incorrect results.

You can find the free Discover tool for both Y and mtDNA, here, and you can join FamilyTreeDNA’s Mitochondrial DNA Group, here, and the Big Y Group, here.

AI Training and AI at Work

AI is trained on massive datasets of mostly unknown origin, including all public postings such as Reddit and Facebook public groups, pages and postings.

In other words, AI is always accruing additional information, including data uploaded by users.

As genealogists, we are already aware of the dangers of unsourced trees and and information that is repeated and copy/pasted without verification.

AI’s training provides more than just data points for you to evaluate, like trees.

AI bots are trained to interact in a humanlike manner. So instead of trees with hints, think hypothetically of an AI bot that reads the trees, then “creates” a wonderful story or infographic about your ancestor – that may or may not be either fully or partially accurate. But it’s beautiful, heartwarming and you love it! Plus, you don’t have to sort through all those trees, hints, and do the work yourself. AI did it for you! Win – win, right? Wrong.

AI knows how to very effectively manipulate language, images, and with them, emotion. Yours, to be specific. That’s both the bad news and the good news.

AI also has the ability to sift through large amounts of data and summarize succinctly – sometimes even correctly. Sometimes it takes several refinements to obtain something that’s both correct and what you want. AI can discern patterns in massive amounts of data that we cannot, at least not readily.

Think of AI as your not-so-trusty but very confident and friendly intern – and I don’t necessarily mean a college intern.

Remember when you see AI published by others, their intern has been at work too.

AI itself is not a sentient being. It’s not inherently ethical or unethical. However, it has been trained to interact with you in a human way. It’s easy after tens of thousands of years of human conditioning for us to interpret AI as human.

Let me give you an example.

I use ChatGPT regularly and was having an interactive conversation after asking it a question. ChatGPT replied that it didn’t know, which is a substantial and startling improvement over earlier versions. I replied, “I’m one of the team members, and even I don’t know.” Really, there was no reason for me to say that, except we interact with our GPTs as human, sometimes even naming them. Then, ChatGPT said, “That made me laugh.”

I was a bit startled.

That made ME laugh, because AI is a machine. It can’t laugh, but it has been trained how to interact with us in a humanlike manner – often sycophantically. Remember how LLMs are trained. It knows what to say next. The smiley face was probably its “humor” clue. Making your interactions both useful and enjoyable keeps you paying your monthly subscription fee.

Remember that AI has no morals, because it’s a machine, and no ethics, for the same reason. That falls to the humans driving. If someone intentionally drives their car into a crowd, it’s not the car’s fault.

AI currently doesn’t have the ability to self-check or self-regulate, though this has improved somewhat in recent months and will, hopefully, continue to improve over time.

People who use AI can use the results for good, for nefarious purposes, or simply as a “time-saving” assistant. There are no guardrails. I could give you very ugly examples, but I’ll simply say that, if prompted, AI will generate the worst things you can imagine, including nonconsensual adult images of people that never happened. These are generally called deepfakes, although deepfakes aren’t always generated in a negative context. I’ll discuss this phenomenon as part of Generative AI in the final article where we’ll cover the dark side of AI.

Conversely, AI can be intended for good by its human “driver” but still be inaccurate and, consequently, unintentionally inflict damage or spread misinformation.

The Bottom Line

Here’s the bottom line.

Your personal threat level warning flag now needs to be permanently set to red.

You need to be increasingly vigilant, meaning actively suspicious, of absolutely everything, even exchanges that used to be safe. In other words, if you receive an email from an organization or government agency that you’ve interacted with in the past – don’t click on an embedded link because you always have in the past and it was safe then.

Hint: Go to the website directly. E-mails are very easy to spoof and your SS account password, for example, is invaluable to a hacker.

The bad guys have gotten really good at being horrible. AI is becoming more difficult to detect every day – even for those of us with a significant amount of experience.

I realize that I sound paranoid, but I just completed security update training, and the threat landscape is worse than I ever imagined. I’ll be sharing that information throughout these articles. Better paranoid and safe than trusting and sorry. What I’m striving for is an appropriate amount of alarm and a safe level of balance. I don’t want you to learn the hard way.

Today’s tip-offs that something is AI-generated will be gone tomorrow.

To use AI tools is to learn what AI output looks and feels like, so you can recognize when you encounter AI that you didn’t generate.

Now that we know what AI is, and isn’t, the next article will focus on AI Assistants, using AI successfully, and how to avoid pitfalls. You don’t want to be the president of the AI Fan Club, nor do you want to feel like you’re in an AI Escape Room.

_____________________________________________________________

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

Subscribe!

If you haven’t already subscribed, it’s free. You’ll receive an e-mail whenever I publish by clicking the “follow” button at the top of the main blog page, here.

Help Keep This Blog Free

I receive a small commission when you click a vendor link in my articles and purchase that item. This does NOT increase your price but helps me keep the lights on and this informational blog free for everyone. Please click on the affiliate links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Uploads

FamilyTreeDNA – Y-DNA, mitochondrial and autosomal DNA testing
MyHeritage DNA – Autosomal DNA test
AncestryDNA – Autosomal DNA test
AncestryDNA Plus Traits
23andMe Ancestry – Autosomal DNA only, no Health
23andMe Ancestry Plus Health

Genealogy Products and Services

MyHeritage Subscription with Free Trial
Legacy Family Tree Webinars – Genealogy and DNA classes, subscription-based, some free
Legacy Family Tree Software – Genealogy software for your computer
OldNews – Old Newspapers with links to save to MyHeritage trees
MyHeritage Omni comprehensive “everything included” subscription plan
Newspapers.com – Search newspapers for your ancestors
NewspaperArchive – Search different newspapers for your ancestors

My Books

DNA for Native American Genealogy – by Roberta Estes, for those ordering the e-book from anyplace, or paperback within the United States
DNA for Native American Genealogy – for those ordering the paperback outside the US
The Complete Guide to FamilyTreeDNA – Y-DNA, Mitochondrial, Autosomal and X-DNA – for those ordering the e-book from anyplace, or paperback within the United States
The Complete Guide to FamilyTreeDNA – Y-DNA, Mitochondrial, Autosomal and X-DNA for those ordering the paperback from outside the US

Genealogy Books

Genealogical.com – Lots of wonderful genealogy research books
American Ancestors – Wonderful selection of genealogy books

Genealogy Research

Legacy Tree Genealogists – Professional genealogy research

MyHeritage Whole Genome Sequencing (WGS) Results and Comparison

Posted on February 19, 2026 by Roberta Estes

I’m excited to receive my low-pass whole-genome sequencing test results from MyHeritage. When MyHeritage initially introduced their new test, I wrote about what that means in the article, MyHeritage Introduces a Low-Pass Whole Genome Autosomal DNA Test and Why It Matters.

In that article, I said I was ordering a WGS test and would publish a comparison of the new test with the two tests I’ve previously taken with MyHeritage, plus a test I uploaded to MyHeritage from FamilyTreeDNA in 2016.

Before I review these comparative results with you, I want to properly set expectations.

What To Expect from the MyHeritage Whole Genome Sequence (WGS) Test

From Ran Snir, Vice President of Product Management for MyHeritage DNA:

Those who take a MyHeritage DNA test now are all sequenced with WGS and will receive the same access to features and results as those who have taken a MyHeritage DNA test and were genotyped with an old chip in the past. In fact, all samples processed by the lab in December were already processed with WGS.
The transition to WGS does not introduce new features and capabilities immediately.
The new WGS technology has minor implications when it comes to the ethnicity estimate results and DNA Matches, but people should not expect to get “something completely different”.
The transition to WGS and having more people processed with it opens the door for deeper research and more insights. It will allow MyHeritage to drastically improve its phasing, imputation and matching algorithms. This will take time as MyHeritage needs to amass a lot of data first. In the long run, MyHeritage plans to improve the product, build new features and introduce new capabilities which will be based on learnings from WGS.

Thank you, Ran.

In this article, I am not focusing on ethnicity, but on DNA matches, which I depend on to help me unravel those pesky genealogy puzzles.

Also, please note, some features I’m discussing here are free with the purchase of a DNA test, and others require a subscription at some level. I have a subscription, and I use it nearly every day.

Coupon Code for a $20 DNA Test

That said, if you already know you want to order the WGS test, or you’re a new tester, use this special coupon code at checkout to reduce the test to $20 through the end of February 2026. That’s a great value!

Coupon Code: RobertaFeb26

Now, let’s look at my results when comparing the WGS test to the results of my other three tests at MyHeritage.

How does the new WGS test fare?

My Results

Before ordering my WGS test from MyHeritage, I already had three tests at MyHeritage to choose from.

MyHeritage allows you to select between different tests, including uploads and tests you’ve taken at MyHeritage at different times. There’s absolutely no need to delete older tests there, and in fact, I recommend that you don’t. This article illustrates why.

My four tests include:

FamilyTreeDNA (FTDNA) test uploaded to MyHeritage in 2016
MyHeritage health test taken in 2019
MyHeritage test taken in June 2024
MyHeritage low-pass whole genome test (WGS) taken in December 2025

	FTDNA 2016	MH Health 2019	MH 2024	MH WGS
Total Matches	19,722	17,179	17,767	17,676
TOFR	128	111	108	Not ready

This chart shows the total number of matches and Theories of Family Relativity for each test in January 2026.

What Are Theories of Family Relativity (TOFR)?

I have several very useful Theories of Family Relativity (TOFR) where MyHeritage uses trees and other documentation, such as census records, to connect you and your DNA matches to common ancestors. TOFR is one of MyHeritage’s most beneficial tools.

In this example, my match only provided their father’s name, but that name was linked to our common ancestors by connecting through a FamilySearch tree. Often, multiple potential relationships and paths are shown. Like with any other tool, each theory needs to be reviewed for accuracy.

Please note that TOFR is only run periodically and has not yet been calculated for the WGS test results. I’m sure that will happen soon.

Evaluating Matches

I wanted to know if (and how) the same people matched me on the different tests, including the new low-pass whole genome (WGS).

Are there differences?
Are the differences slight or pronounced?
Do some people match me on some tests, and not others?
Do some people match me on earlier tests, but not the WGS?
Do some people match me on the WGS, but not earlier tests?
What is the takeaway from all of this?

To compare the results of all four tests, I created a side-by-side comparison spreadsheet.

The Spreadsheet

I created a spreadsheet where I recorded 434 individual matches by entering information in the following columns:

A – Match number that I assigned
B – Match source (more about this in a minute)
C – FTDNA 2016 test matching number of cMs
D – MH 2019 matching number of cMs
E – MH 2024 Health matching number of cMs
F – MH Low Pass Whole Genome Sequencing (WGS) matching number of cMs
G – Relationship if known
H – Common Ancestor if known

I included several other columns in my spreadsheet for my own genealogical research purposes that show my matches’ tree size, and the actual lineage from them to our common ancestor couple. However, for comparing matches and accuracy, I’ve utilized the columns indicated above.

Match Sources

I wanted to compare different types of matches, meaning not just the closest or the most distant, or only the matches I can identify. These are the sources of the matches I compared.

Cousin Finder – I actually started a spreadsheet back in October 2025 when I was using Cousin Finder to find cousins, meaning people with common ancestors identified by MyHeritage. Twenty-eight of the 378 people that MyHeritage identified as cousins are DNA matches, so those were the first matches I entered into this comparison spreadsheet, along with our most recent common ancestors.
TOFR – All Theories of Family Relativity begin with DNA matches, then connect you and your matches together using trees and/or documents, when possible. Because matches vary with each of the tests, so do the TOFRs. WGS theories aren’t yet calculated, but the matches are, so I’ve included TOFR matches here.
Family Kits – These 15 matches are family members’ tests that I manage and match, so I clearly know how we’re related.
Top 100/150 – The first group of matches, other than the above categories, were the top 100 matches using the FamilyTreeDNA 2016 kit, which was my first test at MyHeritage. All tests continue to accumulate matches over time, so it just made sense to start here.

However, after I finished transcribing each of those 100 matches into the spreadsheet and started transcribing the top 100 matches for the MyHeritage 2019 test, I quickly realized that the top 100 matches were not the same between tests. Therefore, I used the top 100 matches from all 4 tests. For every name included from any test in the top 100, I included the matching cM amount from all four tests. This means that in total, there are more than 100 in the “Top 100”, so now it’s called the Top 100/150, but all of the top 100 matches from each of the four tests are included in the spreadsheet. In total, there are about 220 in that category.

Bottom 100 – Last, I included the bottom 100 matches on the FTDNA 2016 kit, meaning I listed those and searched for them on the other tests. If I had included the bottom 100 from all four tests, it would have been more like the bottom 350.

When I finished listing all of these matches, I had 434 to work with for this comparison. .

Minimum Matching

The minimum MyHeritage reported match is 8 cM, and at that level, a surprising number of tests don’t match either parent, although some clearly match with close relatives on that parent’s side, which means that either:

Those tests (either mine or the match’s, or both) were uploaded and imputed
Some portion of the parents’ test did not read
These are not valid matches, meaning they are identical by chance, not by descent.

About Imputation

Imputation is a widely used technology among vendors to bridge small sections of unread DNA. This is useful when comparing files from different vendors for matching.

Vendors use imputation internally too.

For example, vendors often use different DNA chips in the lab. They sometimes change chips internally, as well, for a variety of reasons. Regardless of why, the same locations aren’t always read, or aren’t read successfully. Imputation levels the playing field, allowing backwards compatibility, and compatibility for matching across platforms. Imputation fills in the blanks to equalize those files, allowing them to be compared for matching.

Let me give you an example. Let’s say you have the letters c_t, where the middle letter between c and t is missing. In English, there are a limited number of letters that can be. To begin with, it must be a vowel. In this case, it has to be either a, o or u. Next, looking at context, if the surrounding words are “the c_t chased a mouse,” the missing word is not cut or cot. It’s almost certainly cat, so the “a” is filled in using imputation.

Imputation usually works well, but occasionally it can extend matching areas improperly. This has always been true, and it’s still true with the new low-pass WGS test. The new WGS test only scans the genome twice to keep the test affordable. Any “no read” area must be imputed. I wrote about imputation here.

Ok, back to the MyHeritage comparison!

Test Comparison Methodology

If you’re recreating this process with your own results:

Color-code the column headers for the various tests
Label them clearly so you can easily differentiate between tests
Freeze your top row

Select the test you want to search for matches, and record the people you want to cross-check. I began the process with my FTDNA test that I uploaded to MyHeritage in 2016.

I entered the matches on my spreadsheet, recording the matching cM amount. Then I selected the other tests, one by one, and searched for the same match name.

In this case, I started with the FTDNA 2016 test. Jane Jones (not her real name) matched me at 744 cM.

Then I selected the MyHeritage 2019 test, searched for Jane’s name, and recorded the match amount – 739 cM. I did the same with the 2024 test, and last, the WGS test.

When searching by surname at MyHeritage, don’t always expect the person to be at the top of the list where you might expect. Be sure to scroll down a bit, even to page two, especially with common names. MyHeritage also displays people with the same surname in their trees.

Match Analysis

As we work through these match results, keep in mind that the comparison percentage numbers only pertain to the 434 people that I’ve selected to compare across all four tests. This is NOT the total amount in any category for all of my matches. There’s no way to make that determination without manually comparing every single match for all four tests – which is why I selected what I felt was a representative sample.

You’ll quickly discover that many people DON’T MATCH you on all the DNA tests. You’ll notice as I give examples that I’ve colored coded some cells for my own use in both interpreting matches as well as sorting them. For example, People who don’t match on that test were labeled “none” and colored bright blue. Eventually, I simply entered “0” instead of the word “none” so I could perform math functions on those cells. I retained the blue so I could filter by cell color. You get the idea.

Using the new WGS test, 16 people (3.7% of 434) match me ONLY on the WGS test, but do NOT match me on any of the other tests.

Interestingly enough, they are all in the Top 100/150 category for the WGS test. Those match results range from 45 cMs to 53 cMs.

That’s NOT a trivial amount of DNA. It’s rather confusing how someone could match at that level on the WGS test, but not at all on the others.

Equally as interesting is that two of those 16 WGS matches don’t match either of my parents.

So, let’s say this another way to be clear – I only see these matches on the WGS test, and none of the other tests.

How Many People Match Me on Only One Test?

Ok, so how many people match me on ONLY one test?

FTDNA 2016 Only Matches	MH 2019 Only Matches	MH 2024 Only Matches	MH WGS Only Matches
44 (10.1%)	3 (0.7%)	3 (0.7%)	16 (3.7%)

44 people match me ONLY on the FamilyTreeDNA 2016 uploaded test.
3 people match me ONLY on the MyHeritage 2019 and 2024 tests, respectively, but not the same three people
16 people match me ONLY on the MyHeritage WGS test

Extrapolating these percentages to the rest of my matches suggests the following number of people would match ONLY on this test in the entire match list for each test.

	FTDNA 2016	MH Health 2019	MH 2024	MH WGS
Total Matches	19,722	17,179	17,767	17,676
Extrapolated Matches on Only This Test	10.1% or 1992 matches	0.7% or 120 matches	0.7% or 124 matches	3.7% or 654 matches

How Many People DON’T Match Me on a Specific Test?

Now, how many people DON’T match me on a specific test?

No FTDNA Match	No MH 2019 Match	No MH 2024 Match	No MH WGS Match
36 (8.3%)	117 (27%)	126 (30%)	96 (22%)

36 people don’t match me on the FamilyTreeDNA test, but do match me on at least one other test at MyHeritage
117 people don’t match on the 2019 MyHeritage test, but do match on at least one other test at MyHeritage
126 people don’t match on the 2024 MyHeritage test, but do match on at least one other test at MyHeritage
96 people don’t match me on the WGS test, but do match me on at least one other test at MyHeritage

Extrapolating these percentages provides an extrapolated number of matches that I don’t match on any specific test, but that I do match on at least one other test.

	FTDNA 2016	MH Health 2019	MH 2024	MH WGS
Total Matches	19,722	17,179	17,767	17,676
Extrapolated # That Don’t Match on This Test	8.3% or 1637 matches	27% or 4638 matches	39% or 5330 matches	22% or 3,889 matches

How Many Match Me On All Tests

195 matches, or 44.9%, nearly half of my matches, match on all four tests at some level.
Out of those, 68, or 15.7% of the total number of matches match me at exactly the same cM level across all 4 tests. That’s pretty remarkable.

The Largest Differences Between Tests

Another question might be how large the difference is between the various matches.

I calculated the largest differences between the highest and lowest match values between the four tests, and placed that value in column G. This means that I subtracted the lowest value of the four tests on this particular match, from the highest value.

In the first row, that means I subtracted 0, the MH 2019 test value, from 75, the WGS test value. The difference between the lowest and highest values is 75 cMs.

Next, I sorted, highest to lowest in column G, so the largest difference is displayed at the top.

I was VERY surprised to see a difference as high as 75 cM, so let’s evaluate the results where the difference is 50 cM or greater. Thirteen matches fall into this category.

Entry 168 – The largest difference at 75 cM. This person matches me at 70, 74 and 75 cM, but not at all on the 2019 test, which caused me to go back and check again. Did I spell the name correctly? Yes, I did. We don’t know why I don’t match this person on the 2019 test, but the other matching cM values are very close so they look to be correct.
Entry 183 – I match this person on both the FamilyTreeDNA uploaded test at 71 cMs, and the WGS test at 35 cMs, around half as much on the WGS test as the FamilyTreeDNA test. I don’t match them at all on either the MyHeritage 2019 or 2024 kits. I have no explanation.
Entry 186 – Like entry 168, we match on three of four tests at 62, 63 and 67 cMs, with the non-matching test being the 2019 test. I would presume that this match is accurate as well.
Entry 191 – This one is interesting because I match this person on the FamilyTreeDNA uploaded test at 64 cMs, but none of the other tests.
Entry 192 – We match at 57 cMs on both the WGS and the 2024 tests, but not the 2019 test, where we don’t match at all. The match on the FamilyTreeDNA test is 11 cMs lower, at 46 cMs.
Entry 228 – This person is my half 1C1R, and I match them on all the tests, of course. However, there’s a 53 cM difference between the WGS and the FamilyTreeDNA uploaded test. In a relationship this close, 53 cM is a small percentage and won’t affect matching, but it’s not an insignificant amount of DNA.
Entries 233, 247, 248 and 283 – I match these people ONLY on the WGS test at 50, 52 and 53 cMs, so if I hadn’t taken the WGS test, I wouldn’t match them at all. Without additional research, we can’t tell if this is a legitimate match or not, but 50-53 cM would be a lot to be imputed or to be identical by chance. These people also match one of my parents’ tests, which eliminates the identical by chance possibility, meaning some of the DNA matches my mother and some matches my father – at least on my end. We can’t determine if this match is identical by chance on their side. I’ve never seen a 50+ cM segment (or even close) that is identical by chance, though.
Entry 249 – Matches on the FamilyTreeDNA test at 15 cM, and the MyHeritage 2024 test at 52 cM, but not the others.
Entry 250 – Matches at 51 cM on the FamilyTreeDNA test, but not on any of the MyHeritage tests.
Entry 279 – Matches only on the MyHeritage 2019 test at 50 cM.

Difference Range

Next, let’s review the entire range of differences, meaning the largest matching difference for any one person across all four tests, by group. I’m including all 434 here so you can judge for yourself.

50-75 cM difference – 13 matches analyzed above

45-49 cM difference – 22 matches

40-44 cM difference – 11 matches

35-39 cM difference – 9 matches

30-34 cM difference – 10 matches

25-29 cM difference – 12 matches

20-24 cM difference – 26 matches

16-19 cM difference – 28 matches

13-15 cM difference – 29 matches

9-12 cM difference – 29 matches

8 cM difference – 78 matches

The 8 cM difference has the most of any value or category because this is the lowest level of matching at MyHeritage. Many tests have a minimum level match on a test or tests, and no others.

6-7 cM match difference – 41 matches

The match differences at 5 cM and below are inconsequential. 57 matches fall into this category.

Commentary

One of the indicators of a valid match is if a parent has tested and also matches,

Of these 434 matches, 35 match neither parent, and most of those are at the smallest match level, meaning 8 cM. Of all the match amounts, that would be the least reliable, and most likely to be a false positive match, or identical by chance.

However, that’s not universally the case. Some WGS results match people at significantly higher levels, but don’t match parents. Two WGS matches match people at 47 and 49 cMs, respectively, and not on any of the other tests. Those two WGS matches don’t match either parent.

After reviewing all 434 selected matches, it appears that both the FamilyTreeDNA 2016 test, and the WGS test produce the most consistent and reliable results of the four tests.

44 people, or 10%, match on BOTH the WGS and the FTDNA tests, but neither of the other two tests. A total of 13.8% match EITHER the FTDNA test OR the WGS test, but not the others.

Conclusions

I think we can draw several conclusions from this comparison.

First, let’s evaluate the number of matches. Looking at the differences between the total number of matches between the various tests, especially the three MyHeritage tests, over time, isn’t that great. That’s exactly why you can’t depend on these numbers as an accurate comparison.

	FTDNA 2016	MH Health 2019	MH 2024	MH WGS
Total Matches	19,722	17,179	17,767	17,676

There are only a few hundred differences between the three MyHeritage tests, and about 2000 between the FamilyTreeDNA test uploaded in 2016 and the various MyHeritage tests. That’s a substantial difference.

The difference number of matches between tests may seem irrelevant, especially the MyHeritage tests, until you realize that those who match AREN’T ALL THE SAME PEOPLE. In other words, comparing the MyHeritage 2024 test with the WGS test only shows a difference of 91 matches. This DOES NOT mean that the MyHeritage 2024 test and the WGS test have 17,676 of the same people who match both tests, and that the 2024 test simply has 91 more matches than the WGS test.

As we’ve seen, many people who appear on any one match list don’t appear on other match lists.

Our analysis showed that 44.9% of my 434 matches compared appear on all match lists, which means that more than half of my matches appear on one or more match lists, and not the others. Therefore, just comparing the number of matches isn’t really relevant. You need to compare the people included on all the different tests, which is why I created my spreadsheet and included people from a wide variety of sources.

3.7% of my matches on the WGS test were not on any other test, which extrapolates to approximately 654 of my total WGS matches that I wouldn’t receive any other way.

I care a great deal about those matches, especially since at least some appear to be high value.

Yes, I absolutely, positively want those matches, especially when you consider that some of the matching differences are as high as 75 cM. A 75 cM match can be in the second, third or fourth cousin range.

Realistically, they may or may not be valid or useful matches – but if I don’t have the opportunity to compare them, I’ll never know.

Should You Purchase the WGS Test If You’ve Already Tested?

So, now for the question you’re surely asking yourself.

Truthfully, when I ordered my test back in December, I was ambivalent. I only ordered it to do this comparison for my blog readers – and I really dislike spending money on something that I don’t think will benefit me.

Note the words “don’t think.”

I’ve changed my mind, for several reasons, and I’m glad I ordered the test.

The thing that changed my mind was that I received a nontrivial amount of matches on the WGS test that I didn’t receive on any of the others – even if some of them turn out to be identical by chance.

Since we can’t go back in time and take the earlier tests, and MyHeritage no longer accepts uploads from other vendors, our decision now is whether or not we should take the new WGS test, or not, especially if we already have a DNA test at MyHeritage.

If you’re a new tester, by all means, test at all four of the main vendors. DNA matching is the best thing since sliced bread.

However, the people I’m really speaking to here are those who already have a test of some sort at MyHeritage.

Here’s the bottom line:

You will receive WGS matches that you didn’t receive on your other test – and vice versa, so don’t delete your older test at MyHeritage
Some of those new WGS matches may well be high-value matches – as was illustrated in the “differences” I discovered.
Given the differences in who is included in the match list, your TOFR will be different too – perhaps leading to a brick wall breakthrough. I have two that I’m just itching to solve.
Use matches, shared matches and TOFR from ALL of your tests at MyHeritage.

My Biggest Regret

As Ran Snir said, new features and developments at MyHeritage will be based on the WGS test. We don’t know what those developments might be, or when they will become available. But it’s very clear that while testers on the older testing platforms will receive as much as MyHeritage can give them, the MyHeritage DNA future is being build on the WGS platform. I want to be there and benefit from new discoveries.

My biggest regret is that my parents aren’t around to take the new WGS test – and neither are several other family members.

Of the 15 family members whose tests I manage at MyHeritage, 9 are deceased, and I think that four more are as well. Two others are now quite elderly and are no longer able to consent or retest.

Your closest family members are your DNA anchors, identifying lineages and pointing you in specific directions, guiding your research.

The very best thing you can do for your genetic genealogy is to test your grandparents if they are living, and your parents. If they aren’t available, test your closest relatives such as grandparents, siblings, aunts, uncles and first cousins.

Preparing for the Future

So, here’s my advice:

Take the WGS test yourself in order to glean as much information as possible and to benefit from future developments.
Retest any relatives whose tests you manage on the WGS platform, if possible.
Test your close family members and anyone you know whose DNA test could help you identify ancestral lineages.

Why is testing your relatives important?

Close relatives will carry some of the DNA from your mutual ancestors that you don’t.

Having the known DNA of your ancestors means that you can evaluate and analyze the trees of the entire group of people who match those identified DNA segments to see if you can break down an upstream brick wall.

I’ve been successful doing this for some time – and am in the process again by combining DNA matches and traditional records research.

Coupon Code for $20 DNA Test

MyHeritage has been kind enough to provide a limited-time coupon code (RobertaFeb26) for my readers which DROPS YOUR PRICE for the DNA test to $20 through February 28^th at midnight.

This is the absolute lowest price I’ve ever seen for a DNA test.

You’ll receive the following features that are included with every test:

Ethnicity and ethnicity map
DNA matches and the ability to contact them
Shared ancestral surnames
Chromosome browser
cM Explainer

In addition, with this code you’ll receive both Shared DNA Matches and Shared Ancestral Places that usually require a subscription.

Normally, a subscription is required to access:

Trees of DNA matches
Shared DNA Matches (free now with the coupon code)
Shared ancestral places (free now with the coupon code)
AutoClusters
Theory of Family Relativity (TOFR)

If you’re interested in trying a subscription, click here to purchase a MyHeritage subscription with a free trial.

Here’s the link to purchase the DNA test, and here’s the coupon code to enter at checkout: RobertaFeb26

And yes, absolutely feel free to share the coupon code with your family, friends, and anyone else who might benefit.

Let me know how your results compare when you receive them.

_____________________________________________________________

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

Subscribe!

If you haven’t already subscribed, it’s free. You’ll receive an e-mail whenever I publish by clicking the “follow” button at the top of the main blog page, here.

Help Keep This Blog Free

Thank you so much.

DNA Purchases and Free Uploads

FamilyTreeDNA – Y-DNA, mitochondrial and autosomal DNA testing
MyHeritage DNA – Autosomal DNA test
AncestryDNA – Autosomal DNA test
AncestryDNA Plus Traits
23andMe Ancestry – Autosomal DNA only, no Health
23andMe Ancestry Plus Health

Genealogy Products and Services

MyHeritage Subscription with Free Trial
Legacy Family Tree Webinars – Genealogy and DNA classes, subscription-based, some free
Legacy Family Tree Software – Genealogy software for your computer
OldNews – Old Newspapers with links to save to MyHeritage trees
MyHeritage Omni comprehensive “everything included” subscription plan
Newspapers.com – Search newspapers for your ancestors
NewspaperArchive – Search different newspapers for your ancestors

My Books

DNA for Native American Genealogy – by Roberta Estes, for those ordering the e-book from anyplace, or paperback within the United States
DNA for Native American Genealogy – for those ordering the paperback outside the US
The Complete Guide to FamilyTreeDNA – Y-DNA, Mitochondrial, Autosomal and X-DNA – for those ordering the e-book from anyplace, or paperback within the United States
The Complete Guide to FamilyTreeDNA – Y-DNA, Mitochondrial, Autosomal and X-DNA for those ordering the paperback from outside the US

Genealogy Books

Genealogical.com – Lots of wonderful genealogy research books
American Ancestors – Wonderful selection of genealogy books

Genealogy Research

Legacy Tree Genealogists – Professional genealogy research

Great News – Both e-Pub and Print Version of “The Complete Guide to FamilyTreeDNA” Now Available Worldwide

Posted on June 11, 2024 by Roberta Estes

Anyone, anyplace, can order the full-color, searchable, e-pub version of The Complete Guide to FamilyTreeDNA – Y-DNA, Mitochondrial, Autosomal and X-DNA from the publisher, Genealogical.com, here.
Customers within the US can order the black and white print book from the publisher, here.
Customers outside the US can order the print book from their country’s Amazon website. The publisher does not ship print books outside the US due to customs, shipping costs, and associated delays. They arranged to have the book printed by an international printer so that it can be shipped directly to Amazon for order fulfillment without international customers incurring additional expenses and delays. If you ordered the book previously from Amazon and a long delivery time was projected, that should be resolved now and your book should be arriving soon.

Comprehensive

This book is truly comprehensive and includes:

247 pages
More than 267 images
288 footnotes
12 charts
68 tips
Plus, an 18-page glossary

To view the table of contents, click here. To order, click here.

Thank you, everyone, for your patience and your support.

_____________________________________________________________

Follow DNAexplain on Facebook, here.

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

If you haven’t already subscribed (it’s free,) you can receive an e-mail whenever I publish by clicking the “follow” button on the main blog page, here.

You Can Help Keep This Blog Free

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase your price but helps me keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Uploads

FamilyTreeDNA – Y, mitochondrial and autosomal DNA testing
MyHeritage DNA – Autosomal DNA test
MyHeritage FREE DNA file upload – Upload your DNA file from other vendors free
AncestryDNA – Autosomal DNA test
AncestryDNA Plus Traits
23andMe Ancestry – Autosomal DNA only, no Health
23andMe Ancestry Plus Health

Genealogy Products and Services

MyHeritage Subscription with Free Trial
Legacy Family Tree Webinars – Genealogy and DNA classes, subscription-based, some free
Legacy Family Tree Software – Genealogy software for your computer
OldNews – Old Newspapers with links to save to MyHeritage trees
Newspapers.com – Search newspapers for your ancestors
NewspaperArchive – Search different newspapers for your ancestors

My Books

DNA for Native American Genealogy – by Roberta Estes, for those ordering the e-book from anyplace, or paperback within the United States
DNA for Native American Genealogy – for those ordering the paperback outside the US
The Complete Guide to FamilyTreeDNA – Y-DNA, Mitochondrial, Autosomal and X-DNA

Genealogy Books

Genealogical.com – Lots of wonderful genealogy research books
American Ancestors – Wonderful selection of genealogy books

Genealogy Research

Legacy Tree Genealogists – Professional genealogy research

Complete Guide to FamilyTreeDNA Released in Hardcopy

Posted on May 26, 2024 by Roberta Estes

Just what many of you have been waiting for! The hardcopy print version of the Complete Guide to FamilyTreeDNA has just been released.

The e-pub version was previously released and is available to worldwide customers only from the publisher. Now, the paperback print version is available too.
Click here to order the print version from the publisher in the US.
International customers must order the printed book from their country’s Amazon website to avoid delays, customs, and increased shipping costs.

As shown in the table of contents below, The Complete Guide to FamilyTreeDNA contains lots of logically organized information! It includes basic education about genetic genealogy and how it works, instructions on using the FamilyTreeDNA tests and tools, plus an extensive glossary.

Enjoy!

_____________________________________________________________

Follow DNAexplain on Facebook, here.

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

If you haven’t already subscribed (it’s free,) you can receive an e-mail whenever I publish by clicking the “follow” button on the main blog page, here.

You Can Help Keep This Blog Free

Thank you so much.

DNA Purchases and Free Uploads

FamilyTreeDNA – Y, mitochondrial and autosomal DNA testing
MyHeritage DNA – Autosomal DNA test
MyHeritage FREE DNA file upload – Upload your DNA file from other vendors free
AncestryDNA – Autosomal DNA test
AncestryDNA Plus Traits
23andMe Ancestry – Autosomal DNA only, no Health
23andMe Ancestry Plus Health

Genealogy Products and Services

MyHeritage Subscription with Free Trial
Legacy Family Tree Webinars – Genealogy and DNA classes, subscription-based, some free
Legacy Family Tree Software – Genealogy software for your computer
OldNews – Old Newspapers with links to save to MyHeritage trees
Newspapers.com – Search newspapers for your ancestors
NewspaperArchive – Search different newspapers for your ancestors

My Books

DNA for Native American Genealogy – by Roberta Estes, for those ordering the e-book from anyplace, or paperback within the United States
DNA for Native American Genealogy – for those ordering the paperback outside the US
The Complete Guide to FamilyTreeDNA – Y-DNA, Mitochondrial, Autosomal and X-DNA

Genealogy Books

Genealogical.com – Lots of wonderful genealogy research books
American Ancestors – Wonderful selection of genealogy books

Genealogy Research

Legacy Tree Genealogists – Professional genealogy research

Announcing: The Complete Guide to FamilyTreeDNA; Y-DNA, Mitochondrial, Autosomal and X-DNA

Posted on May 4, 2024 by Roberta Estes

I’m so very pleased to announce the publication of my new book, The Complete Guide to FamilyTreeDNA – Y-DNA, Mitochondrial, Autosomal and X-DNA.

For the first time, the publisher, Genealogical.com, is making the full-color, searchable e-book version available before the hardcopy print version, here. The e-book version can be read using your favorite e-book reader such as Kindle or iBooks.

Update: The hardcopy version was released at the end of May and is available from the publisher in the US and from Amazon internationally.

This book is about more than how to use the FamilyTreeDNA products and interpreting their genealogical meaning, it’s also a primer on the four different types of DNA used for genealogy and how they work:

Autosomal DNA
Mitochondrial DNA
Y-DNA
X-DNA

There’s a LOT here, as shown by the table of contents, below

This book is chocked full of great information in one place. As an added bonus, the DNA glossary is 18 pages long.

I really hope you enjoy my new book, in whatever format you prefer.

_____________________________________________________________

Follow DNAexplain on Facebook, here.

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

If you haven’t already subscribed (it’s free,) you can receive an e-mail whenever I publish by clicking the “follow” button on the main blog page, here.

You Can Help Keep This Blog Free

Thank you so much.

DNA Purchases and Free Uploads

FamilyTreeDNA – Y, mitochondrial and autosomal DNA testing
MyHeritage DNA – Autosomal DNA test
MyHeritage FREE DNA file upload – Upload your DNA file from other vendors free
AncestryDNA – Autosomal DNA test
AncestryDNA Plus Traits
23andMe Ancestry – Autosomal DNA only, no Health
23andMe Ancestry Plus Health

Genealogy Products and Services

MyHeritage Subscription with Free Trial
Legacy Family Tree Webinars – Genealogy and DNA classes, subscription-based, some free
Legacy Family Tree Software – Genealogy software for your computer
OldNews – Old Newspapers with links to save to MyHeritage trees
Newspapers.com – Search newspapers for your ancestors
NewspaperArchive – Search different newspapers for your ancestors

My Books

DNA for Native American Genealogy – by Roberta Estes, for those ordering the e-book from anyplace, or paperback within the United States
DNA for Native American Genealogy – for those ordering the paperback outside the US
The Complete Guide to FamilyTreeDNA – Y-DNA, Mitochondrial, Autosomal and X-DNA

Genealogy Books

Genealogical.com – Lots of wonderful genealogy research books
American Ancestors – Wonderful selection of genealogy books

Genealogy Research

Legacy Tree Genealogists – Professional genealogy research

Comparing DNA Results – Different Tests at the Same Testing Company

Posted on May 18, 2023 by Roberta Estes

Several people have asked about different tests at the same DNA testing company. They wondered if matching is affected, meaning whether your matches are different if you have two different tests at the same company. Specifically, they asked if you are better off purchasing a test AT a DNA testing vendor that allows uploads, rather than uploading a test from a different vendor. Does it make a difference to the tester or their matches? Do they have the same matches?

These are great questions, and the answer isn’t conclusive. It varies based on several factors.

Having multiple tests at the same DNA testing company can occur in three ways:

The same person tests twice at the same DNA testing company.
The same person tests once at the DNA testing company and uploads a test from a different testing company. Only two of the primary four DNA testing companies accept uploads from other vendors – FamilyTreeDNA and MyHeritage.
The same person uploads two different files from other DNA testing companies to the DNA testing company in question. For example, the DNA company could be FamilyTreeDNA and the two uploaded DNA files could be from either MyHeritage, 23andMe or Ancestry.

All DNA testing companies allow users to download their raw DNA data files. This enables the tester to upload their DNA file to the vendors who accept uploaded files. Both FamilyTreeDNA and MyHeritage provide matching for free, but advanced tools require a small unlock fee of $19 and $29, respectively.

Testing Company	Accepts Uploads from Other Companies	Download Upload Instructions
23andMe	No	Instructions here
Ancestry	No	Instructions here
FamilyTreeDNA	Yes, some	Instructions here
MyHeritage	Yes, some	Instructions here

I wrote about developing a DNA testing and transfer/upload strategy, here, and about which companies accept which tests, here.

Not all DNA files are created equal. Therefore, not all files from vendors are compatible with other vendors for various reasons.

Multiple Tests at the Same DNA Testing Company

I have at least two tests at each of the four major vendors. I did this for research purposes, meaning to write articles to share with you.

If you actually test twice at a vendor, meaning purchase two separate tests and take them yourself, you will have two test results at that testing company. At some companies, specifically 23andMe, if you purchase a new test through their “upgrade” procedure, you won’t have two tests, just the newer one.

However, if you’re testing at the DNA testing company, and also uploading, I generally don’t recommend more than one test at each vendor. All it really does is clog up people’s match lists with no or little additional benefit. At 23andMe, with their restrictions on the size of your match list, if everyone had two tests, the effective match limit would be half of their stated limit of about 1500 matches for earlier testers and about 5000 for current testers with subscriptions.

So, in essence, I’m telling you to “do as I say, not as I do.” We all have better things to do with our money rather pay for the same test twice. If you haven’t tested your Y-DNA or mitochondrial DNA, that’s much more beneficial than two autosomal tests at one vendor.

Chips and Chip Evolution

Before we begin the side-by-side comparison, let’s briefly discuss DNA testing chips and how they work.

Each DNA testing company purchases DNA processing equipment. Illumina is the big dog in this arena. Illumina defines the capacity and structure of each chip. In part, how the testing companies use that capacity, or space on each chip, is up to each company. This means that the different testing companies test many of the same autosomal DNA SNP locations, but not all of the same locations.

Furthermore, the individual testing companies can specify a number of “other” locations to be included on their chip, up to the chip maximum size limit. The testing companies who offer Y-DNA or mitochondrial DNA haplogroups from autosomal tests use part of their chip array space for selected known haplogroup-defining SNP locations. This does NOT mean that Y-DNA or mitochondrial DNA is autosomal, just that the testing company used part of their chip array space to target these SNPs in your genome. Of course, for your most refined haplogroup and Y-DNA or mitochondrial DNA matching, you have to take those specific tests at FamilyTreeDNA .

This means that each testing company includes and reports many of the same, but also some different SNP locations when they scan your DNA.

In the lab, after your DNA is extracted from either your saliva or the cheek swab, it’s placed on this array chip which is then placed in the processing equipment.

There are several steps in processing your DNA. Each DNA location specified on the chip is scanned and read multiple times, and the results are recorded. The final output is the raw DNA results file that you see if/when you download your raw DNA file.

Here’s an example from my file. The RSID is the reference SNP cluster ID which is the naming convention used for specific SNPs. It’s not relevant to you, but it is to the lab, along with the chromosome number and position, which is in essence the address on the chromosome.

In the Result column, your file reports one nucleotide (T, A, C or G) that you inherited from each parent at each tested position. They are not listed in “parent order” because your DNA is not organized in that fashion. There’s no way for the lab to know which nucleotide came from which parent, unless they are the same, of course. You can read about nucleotides, here.

When you upload your raw DNA file to a different DNA testing company (vendor), they have to work with a file that isn’t entirely compatible with the files they generate, or the other files uploaded from other DNA testing companies.

In addition to dealing with different file formats and contents from multiple DNA vendors, companies change their own chips and file structure from time to time. In some cases, it’s a forced change by the chip manufacturer. Other times, the vendors want to include different locations or make improvements. For example, with 23andMe’s focus on health, they probably add new medically related SNP locations regularly. Regardless of why, some DNA files include locations not included in other files and are not 100% compatible.

Looking at the first few entries in my example file above, let’s say that the testing vendor included the first ten positions, but an uploaded file from another company did not. Or perhaps the chip changed, and a different version of the company’s own file contains different positions.

DNA testing companies have to “fill in the blanks” for compatibility, and they do this using a technique called imputation. Illumina forced their customers to adopt imputation in 2017 when they dropped the capacity of their chip. I was initially quite skeptical, but imputation has worked surprisingly well. Some of the matching differences you will see when comparing the results of two different DNA files is a result of imputation.

I wrote about imputation in an early article here. Please note the companies have fixed many issues with imputation and improved matching greatly, but the concepts and imputation processes still apply. The downloaded raw data files are your results BEFORE imputation, meaning that it’s up to any company where you upload to process your raw file in the same way they would process a file that they generated. A lot goes on behind the scenes when you upload a file to a DNA testing company.

At both 23andMe and Ancestry, you know that all of your matches tested there, meaning they did not upload a file from another testing company. You don’t know and can’t tell what chip was utilized when your matches tested. The only way to determine a chip testing version, aside from knowing the date or remembering the chip version from when you tested, is to look at the beginning of the raw data download file, although not all files contain that information.

Ok, now that you understand the landscape, let’s look at my results at each company.

23andMe

I tested twice at 23andMe on two different chip versions, V3 and V4, which tested some different locations of my DNA. Neither of these chips is the current version. I originally tested twice to evaluate the differences between the two test versions which you can read about, here.

23andMe named their ethnicity results Ancestry Composition.

They last updated my V3 test’s Ancestry Composition results on July 28, 2021.

The percentages are shown at left, and the country locations are highlighted at right for my 23andMe V3 test.

Click to enlarge any graphic

The 23andMe V4 test was also updated for the last time on July 28, 2021.

The ethnicity results differ substantially between the two chip versions, even though they were both updated on the same date.

In October of 2020, in an effort to “encourage” their customers to pay for a new test on their V5 chip, 23andMe announced that there would be no ethnicity updates on older tests. So, I really don’t know for sure when my tests were actually updated. Just note how different the results are. It’s also worth mentioning that 23andMe does not show trace amounts on their map, so even though my Indigenous American results were found, they aren’t displayed on the map.

Indigenous is, however, shown in yellow on their DNA Chromosome Painting.

No other testing company restricts updates, penalizing their customers who purchased earlier versions of tests.

Matches at 23andMe

23andMe limits your matches to about 1500 unless you have purchased the current test, including health AND pay for an annual $69 subscription which buys you about 5000 matches. I have not purchased this test.

Your number of actual matches displayed/retained is also affected by how many people you have communicated with, or at least initiated communications with. 23andMe does not roll those people off of your match list.

I have 1803 matches on both of my tests, meaning I’ve reached out to about 300 people who would have otherwise been removed from my match list. 23andMe retains your highest matches, deleting lower matches after you reach the maximum match threshold.

I’ve randomly evaluated several of the same matches at each vendor, at least five maternal and five paternal, separated by a blank row. I wanted to determine whether they match me on the same number of centimorgans, meaning the same amount of DNA, on both tests, and the same number of segments.

Match	23and Me V3	23and Me V4
Patricia	292 cM – 12 segments	Same as V3
Joe	148 cM, 8 segments	Same
Emily	73 cM, 4 segs	72 cM, 4 seg
Roland	27 cM, 1 seg	Same
Ian	62 cM, 4 seg	Same

Stacy	469 cM, 16 segments	482 cM, 16 segments
Harold	134 cM, 6 segments	Same
Dean	69 cM, 3 seg	Same
Carl	95 cM, 4 seg	Same
Debbie	83 cM, 4 seg	84 cM, 4 seg

As you can see, the matches are either exact or xclose.

Please note that bolded matches are also found at another company. I will include a summary table at the end comparing the same match across multiple vendors.

23and Me Summary

The 23andMe V3 and V4 match results are very close. Since the match limit is the same, and the results are so close between tests, they are essentially identical in terms of matching.

The ethnicity results are similar, but the V4 test reflects a broader region. Italian baffles me in both versions.

Ethnicity should never be taken at face value at any DNA testing company, especially with smaller percentages which could be noise or a combination of other regions which just happens to resemble Italy, in my case.

I don’t know what type of comparison the current chip would yield since I suspect it has more medical and less genealogical SNPs on board.

Reprocessing Tests

This is probably a good place to note that it’s very expensive for any company to update their customer’s ethnicity results because every single customer’s DNA results file must be completely rerun. Note that this does not mean their DNA itself is retested. The output raw data file is reprocessed using a new algorithm.

Rerunning means reprocessing that specific portion of every test, meaning the vendors must rent “time in the cloud.” We are talking millions of dollars for each run. I don’t know how much it costs per test, but think about the expense if it takes $1 to rerun each test in the vendor’s database. Ancestry has more than 20 million tests.

While we, as consumers, are always chomping at the bit for new and better ethnicity results – the testing companies need to be sure it really is “better,” not just different before they invest the money to reprocess and update results.

This is probably why 23andMe decided to cease updating older kits. The newer tests require a subscription which is recurring revenue.

The same is true when DNA testing companies need to rematch their entire user base. This happens when the criteria for matching changes. For example, Ancestry purged a large number of matches for all of their customers back in 2020. While match algorithm changes necessitate rematching, with associated costs, this change also provided Ancestry with the huge benefit of eliminating approximately half of their customer’s matches. This freed up storage space, either physically in their data center or space rented in the cloud, representing substantial cost-savings.

How long can a DNA testing company reasonably be expected to continue investing in a product which never generates additional revenue but for which the maintenance and reinvestment costs never end?

Ancestry and MyHeritage both hope to offset the expenses of maintaining their customer’s DNA tests and providing free updates by selling subscriptions to their record services. 23andMe wants you to purchase a new test and a yearly subscription. FamilyTreeDNA wants you to purchase a Big Y-DNA and mitochondrial DNA test.

OK, now let’s look at my matches at Ancestry.

Ancestry

I’ve taken two Ancestry tests, V1 and V2. There were some differences, which I wrote about here and here. V2 is no longer the current chip.

Except for 23andMe who wants their customers to purchase their most current test, the other companies no longer routinely announce new chip versions. They just go about their business. The only way you know that a vendor actually changed something is when the other companies who accept uploads suddenly encounter an issue with file formats. It always takes a few weeks to sort that out.

My Ancestry V1 test’s ethnicity results don’t show my Native American ethnicity.

Ancestry results were updated in June 2022

However, my V2 results do include Native American ethnicity.

Matches at Ancestry

I have many more matches on my V1 test at Ancestry because I took steps to preserve my smaller matches when Ancestry initiated its massive purge in 2020. I wrote about that here and here.

Ancestry’s SideView breaks matches down into maternal, paternal, and unassigned based on your side selection. You tell Ancestry which side is which. You may be able to determine which “side” is maternal or paternal either by your ethnicity or shared matches. While SideView is not always accurate, it’s a good place to begin.

Match Category	Ancestry V1 Test	Ancestry V2 Test
Maternal	15,587	15,116
Paternal	42,247	41,870
Both	2	2
Unassigned	48,999	4,127
Total	106,835	61,115

Ancestry either displays all your matches or your matches by side, which I used to compile the table above. I suspect that Ancestry is not assigning any of the smaller preserved matches to “sides” based on the numbers above.

Ancestry implemented a process called Timber that removes DNA that they feel is “too matchy,” meaning you match enough people in this region that they think it’s a pileup region for you personally, and therefore not useful. In some cases, enough DNA is removed causing that person to no longer be considered a match because they fall beneath the match threshold. I am not a fan of Timber.

Your match amount shown is AFTER Timber has removed those segments. Unweighted shared DNA is your pre-Timber match amount.

You can view the Unweighted shared DNA by clicking on the amount of shared DNA on your match list.

You can read Ancestry’s Matching White Paper, here.

Let’s take a look at my matches. I’ve listed both weighted and unweighted where they are different.

Match	Ancestry V1	Ancestry V2
Michael	755 cM, 35 seg	737 cM, 33 seg
Edward	66 cM, 4 seg (unweighted 86 cM)	65 cM, 4 seg (unweighted 86 cM)
Tom	59 cM, 3 seg (unweighted 63)	Same
Jonathon	43 cM, 4 seg, (unweighted 52 cM)	Same
Matthew	20 cM, 2 seg (unweighted 35 cM)	Same

Harold	132 cM, 7 seg	135 cM, 6 seg
Dean	67 cM, 4 seg (unweighted 78 cM)	66 cM, 4 seg (unweighted 78 cM)
Debbie	93 cM, 5 seg	Same
Valli	142 cM, 3 seg	Same
Jared	20 cM, 1 seg (unweighted 22 cM)	Same

Timber only removes DNA when the match is under 90 cM. Almost every match under 90 cM has some DNA removed.

Ancestry Summary

The results of the two Ancestry tests are very close.

In some circumstances, no DNA is removed by Timber, so the unweighted is the same as the weighted. However, in other cases, a significant amount is removed. 15 cM of Matthew’s 35 cM was removed by Timber, reducing his total to 20 cM.

Remember that Ancestry does not show shared matches unless they are greater than 20 cM, which is different than any other DNA testing company.

At one point, Ancestry was selling a health test that was also a genealogy test. That test utilized a different chip that is not accepted for uploads by other vendors. The results of that test might well be different that the “normal” Ancestry tests focused on genealogy. The Ancestry health test is no longer offered.

Companies that Accept Uploads

DNA testing companies that accept uploaded DNA files from other DNA testing companies need to process the uploaded file, just like a file that is generated in their own lab. Of course, they must deal with the differences between uploaded files and their own file format. The processing includes imputation and formulates the uploaded file so that it works with the tools that they provide for their customers, including ethnicity (by whatever name they use) matching, family matching (bucketing), advanced matching, the match matrix, triangulation, AutoClusters, Theories of Family Relativity, and other advanced tools.

Of course, the testing company accepting uploads can only work with the DNA locations provided by the original DNA testing company in the uploaded file.

Matching and some additional tools are free to uploaders, but advanced tools require an inexpensive unlock.

FamilyTreeDNA

I took a test at FamilyTreeDNA, plus uploaded a copy of both of my Ancestry DNA files.

FamilyTreeDNA named their population (ethnicity) test myOrigins and the current version is V3. I wrote about the rollout and comparison in September of 2020, here.

My DNA test taken at FamilyTreeDNA, above, reveals Native American segments that match reference populations found both in North and South America and the Caribbean Islands.

At FamilyTreeDNA, my Ancestry V1 uploaded file results show Native American population matches only in North America.

Interestingly, my Ancestry V1 file processed AT Ancestry did not reveal Native American ancestry, but the same file uploaded to and processed at FamilyTreeDNA did show Native American results, reflecting the difference between the vendors’ internal algorithms and reference populations utilized.

My myOrigins results from my Ancestry V2 uploaded file at FamilyTreeDNA also include my North American Native American segments. The V2 test also showed Native American ethnicity at Ancestry, so clearly something changed in Ancestry’s algorithm, locations tested, and/or reference populations between V1 and V2.

Fortunately, FamilyTreeDNA provides both chromosome painting and a population download file so I can match those Native segments with my autosomal matches to identify which of my ancestors contributed those specific segments.

One of my Native segments is shown in pink on Chromosome1. My mother has a Native segment in exactly the same location, so I know that this segment originated with my mother’s ancestors.

I downloaded the myOrigins population segment file and painted my results at DNAPainter, along with the matches where I can identify our common ancestor. This allowed me to pinpoint the ancestral line that contributed this Native segment in my maternal line. You can read about using DNAPainter, here.

FamilyTreeDNA Matches

I have significantly more matches at FamilyTreeDNA on their test than on either of my Ancestry tests that I uploaded. However, nearly the same number are maternally or paternally assigned through Family Matching, with the remainder unassigned. You can read about Family Matching here.

Match Category	FamilyTreeDNA Test	Ancestry V1 at FamilyTreeDNA	Ancestry V2 at FamilyTreeDNA
Paternal	3,479	3,572	3,422
Maternal	1,549	1,536	1,477
Both	3	3	3
All	8,154	6,397	6,579

Family matching, aka bucketing, automatically assigns my matches as maternal and paternal by linking known relatives to their place in my tree.

I completed the following match chart using my original test taken at FamilyTreeDNA, plus the same match at FamilyTreeDNA for both of my Ancestry tests.

In other words, Cheryl matched me at 467 cM on 21 segments on the original test taken at FamilyTreeDNA. She matched me on 473 cM and 21 segments on my Ancestry V1 test uploaded to FamilyTreeDNA and on 483 cM and 22 segments on the Ancestry V2 test uploaded to FamilyTreeDNA.

Match	FamilyTreeDNA	Ancestry V1 at FTDNA	Ancestry V2 at FTDNA
Cheryl	467 cM, 21 seg	473 cM, 21 seg	483 cM, 22 seg
Patricia	195 cM, 11 seg	189 cM, 11 seg	188 cM, 11 seg
Tom	77 cM, 4 seg	71 cM, 4 seg	76 cM, 4 seg
Thomas	72 cM, 3 seg	71 cM, 3 seg	74 cM, 3 seg
Roland	29 cM, 1 seg	35 cM, 2 seg	35 cM, 2 seg
Rex	62 cM, 4 seg	55 cM, 3 seg	57 cM, 3 seg
Don	395 cM, 18 seg	362 cM, 15 seg	398 cM, 18 seg
Ian	64 cM, 4 seg	56 cM, 4 seg	64 cM, 4 seg

Stacy	490 cM, 18 seg	494 cM, 15 seg	489 cM, 14 seg
Harold	127 cM, 5 cM	133 cM, 6 seg	143 cM, 6 seg
Dean	81 cM, 4 seg	75 cM, 3 seg	83 cM, 4 seg
Carl	103 cM, 4 seg	101 cM, 4 seg	102 cM, 4 seg
Debbie	99 cM, 5 seg	97 cM, 5 seg	99 cM, 5 seg
David	373 cM, 16 seg	435 cM, 19 seg	417 cM, 18 seg
Amos	176 cM, 7 seg	177 cM. 8 seg	177 cM, 7 seg
Buster	387 cM, 15 seg	396 cM, 16 seg	402 cM, 17 seg
Charlene	461 cM, 21 seg	450 cM, 21 seg	448 cM, 20 seg
Carol	65 cM, 6 seg	64 cM, 6 seg	65 cM, 6 seg

I have tested many of my cousins at FamilyTreeDNA and encouraged others to test or upload. I’ve attempted to include enough people so that I can have common matches at least at one other DNA testing company for comparison.

FamilyTreeDNA Summary

The matches are relatively close, with a few being exact.

Interestingly, some of the segment counts are different. In most cases, this results from one segment being broken into multiple segments by one or more of the tests, but not always. In the couple that I checked, the entire segment seems to descend from the same ancestral couple, so the break is likely a result of not all of the same DNA locations being tested, plus the limits of imputation.

MyHeritage

I have two tests at MyHeritage. One taken at MyHeritage, and an uploaded file from FamilyTreeDNA.

MyHeritage displays both ethnicity results and Genetic Groups which maps groups of people that you match. I left the Genetic Groups setting at the highest confidence level. Shifting it to lower displays additional Genetic Groups, some of which overlap with or are within ethnicity regions.

My test taken at MyHeritage, above, shows several ethnicities and Genetic Groups, but no Native American.

My FamilyTreeDNA kit processed at MyHeritage shows the same ethnicity regions, one additional Genetic Group, plus Native American heritage in the Amazon which is rather surprising given that I don’t show Native in North American regions where I’m positive my Native ancestors lived.

MyHeritage Matching

At MyHeritage, I compared the results of the test I took with MyHeritage, and a test I uploaded from FamilyTreeDNA. Fewer than half of my matches can be assigned to a parent via shared matching.

Matches	MyHeritage Test	FamilyTreeDNA at MyHeritage
Paternal	4,422	6,501
Maternal	2,660	3,655
Total	13,233	16,147

I have rounded my matches at MyHeritage to the closest cM.

Match	MyHeritage Test	FamilyTreeDNA at MyHeritage
Michael	801 cM, 32 seg	823 cM, 31 segments
Cheryl	467 cM, 23 seg	477 cM, 23 seg
Roland	No match	28 cM, 1 seg
Patty	156 cM, 9 seg	151 cM, 9 seg
Rex	43 cM, 4 seg	53 cM, 3 seg
Don	369 cM, 16 seg	382 cM, 17 seg

David	449 cM, 17 seg	460 cM, 17 seg
Charlene	454 cM, 23 seg	477 cM, 24 seg
Buster	408 cM, 15 seg	410 cM, 16 seg
Amos	183 cM, 8 seg	Same
Carol	78 cM, 6 seg	87 cM, 7 seg

MyHeritage Summary

I was surprised to discover that Roland had no match with the MyHeritage test, but did with the FamilyTreeDNA test. I wonder if this is a searching or matching glitch, especially since both companies use the same chip. 28 cM in one segment is a reasonably large match, and even if it was divided in two, it would still be over the matching threshold. I know this is a valid match because Roland triangulates with me and several cousins, I’m positive of our common ancestor, and he also matches me at both FamilyTreeDNA and 23andMe.

Other than that, the matches are reasonably close, with one being exact.

Your Matches Aren’t Everyplace

I unsuccessfully searched for someone who was a match to me in all four databases. Ancestry does not permit match downloads, so I had to search manually. People don’t always use the same names in different databases.

Surprisingly, I was unable to find one match who is in all of the databases. Many people only suggest testing at Ancestry because they have the largest database, but if you look at the following comparison chart that I’ve created, you’ll see that 16 of 26 people, or 62% were not at Ancestry. Conversely, many people were at Ancestry and not elsewhere. I could not find five maternal and five paternal matches at Ancestry that I could identify as matches in another database. 40% were not elsewhere.

If you think for one minute that it doesn’t matter for genealogy if you’re in all four major databases, please reconsider. It surely does matter.

Every single vendor has matches that the others don’t. Substantial, important matches. I have found first and second-cousin matches in every database that weren’t elsewhere.

Many of the original testers have passed away and can’t test again. My mother can never test at either 23andMe or Ancestry, but she is at both FamilyTreeDNA and MyHeritage because I could upgrade her kit at FamilyTreeDNA after she died. I uploaded her to MyHeritage. Of course, because she is a generation closer to our ancestors, she has many valuable matches that I don’t.

Each vendor provides either an email address or a messaging platform for you to contact your matches. Don’t be discouraged if they don’t answer. Just today, I received a reply that was years in the making.

Genealogists hope for immediate gratification, but we are actually in this for the long game. Play it with every tool at your disposal.

The Answer

Does it matter if you test at a DNA testing company, or upload a file?

I know this was a very long answer to what my readers hoped was a simple yes or no question.

There is no consistent answer at either FamilyTreeDNA or MyHeritage, the two DNA testing companies that accept uploads. Be sure you’re in both databases. My closest two matches that I did not test were found at MyHeritage. Here’s a direct link to upload at MyHeritage.

Of the vendors, those two should be the closest to each other because they are both processed in the GenebyGene lab, but again, the actual chip version, when the test was originally taken, and each vendor’s internal processing will result in differences. Neither the original test at the DNA testing company nor the uploaded files have consistently higher or lower matches. Neither type of test or upload appears to be universally more or less accurate. Differences in either direction seem to occur on a match-by-match basis. Many are so close as to be virtually equivalent, with a few seemingly random exceptions. Of course, we always have to consider Timber.

If you upload, unlock the advanced features at both FamilyTreeDNA and MyHeritage.

If you upload to a DNA testing company, you may discover in the future that some features and functions will only be available to original testers.

Personally, if I had the option, I would test at the company directly simply because it eliminates or at least reduces the possibility of future incompatibilities – with the exception of 23andMe which has chosen to not provide consistent updates to older tests. I’m incredibly grateful I didn’t test my mother or now deceased family members at 23andMe, and only there. I would be heartsick, heartbroken, and furious.

Our DNA is an extremely valuable resource for our genealogy. It’s the gift that truly keeps on giving, day after day, even when other records don’t exist. Be sure you and your family members are in each database one way or another, and test your Y-DNA (for males) and mitochondrial DNA (for everyone) to have a complete arsenal at your disposal.

_____________________________________________________________

Follow DNAexplain on Facebook, here.

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

If you haven’t already subscribed (it’s free,) you can receive an email whenever I publish by clicking the “follow” button on the main blog page, here.

You Can Help Keep This Blog Free

I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Uploads

FamilyTreeDNA – Y, mitochondrial and autosomal DNA testing
MyHeritage DNA – Autosomal DNA test
MyHeritage FREE DNA file upload – Upload your DNA file from other vendors free
AncestryDNA – Autosomal DNA test
23andMe Ancestry – Autosomal DNA only, no Health
23andMe Ancestry Plus Health

Genealogy Products and Services

MyHeritage FREE Tree Builder – Genealogy software for your computer
MyHeritage Subscription with Free Trial
Legacy Family Tree Webinars – Genealogy and DNA classes, subscription-based, some free
Legacy Family Tree Software – Genealogy software for your computer
Newspapers.com – Search newspapers for your ancestors
NewspaperArchive – Search different newspapers for your ancestors

My Book

DNA for Native American Genealogy – by Roberta Estes, for those ordering the e-book from anyplace, or paperback within the United States
DNA for Native American Genealogy – for those ordering the paperback outside the US

Genealogy Books

Genealogical.com – Lots of wonderful genealogy research books

Genealogy Research

Legacy Tree Genealogists – Professional genealogy research

DNA: In Search of…Signs of Endogamy

Posted on August 11, 2022 by Roberta Estes

This is the fourth in our series of articles about searching for unknown close family members, specifically; parents, grandparents, or siblings. However, these same techniques can be applied by genealogists to ancestors further back in time as well.

I introduced the “In Search of” series in the article, DNA: In Search of…New Series Launches.
In the second article, DNA: In Search of…What Do You Mean I’m Not Related to My Family? – and What Comes Next? we discussed the discovery that something was amiss when you don’t match a family member that you expect to match, then how to make sure a vial or upload mix-up didn’t happen. Next, I covered the basics of the four kinds of DNA tests you’ll be able to use to solve your mystery.
In the third article, In Search of…Vendor Features, Strengths, and Testing Strategies, we discussed testing goals and strategies, including testing with and uploading to multiple autosomal DNA vendors, Y DNA, and mitochondrial DNA testing. We reviewed the vendor’s strengths and the benefits of combining vendor information and resources.

In this article, we discuss endogamy – how to determine if you have it, from what population, and how to follow the road signs.

After introductions, we will be covering the following topics:

Pedigree collapse and endogamy
Endogamous groups
The challenge(s) of endogamy
Endogamy and unknown close relatives (parents, grandparents)
Ethnicity and Populations
Matches
AutoClusters
Endogamous Relationships
Endogamous DNA Segments
“Are Your Parents Related?” Tool
Surnames
Projects
Locations
Y DNA, Mitochondrial DNA, and Endogamy
Endogamy Tools Summary Tables
- Summary of Endogamy Tools by Vendor
- Summary of Endogamous Populations Identified by Each Tool
- Summary of Tools to Assist People Seeking Unknown Parents and Grandparents

What Is Endogamy and Why Does It Matter?

Endogamy occurs when a group or population of people intermarry among themselves for an extended period of time, without the introduction of many or any people from outside of that population.

The effect of this continual intermarriage is that the founders’ DNA simply gets passed around and around, eventually in small segments.

That happens because there is no “other” DNA to draw from within the population. Knowing or determining that you have endogamy helps make sense of DNA matching patterns, and those patterns can lead you to unknown relatives, both close and distant.

This Article

This article serves two purposes.

This article is educational and relevant for all researchers. We discuss endogamy using multiple tools and examples from known endogamous people and populations.
In order to be able to discern endogamy when we don’t know who our parents or grandparents are, we need to know what signs and signals to look for, and why, which is based on what endogamy looks like in people who know their heritage.

There’s no crystal ball – no definitive “one-way” arrow, but there are a series of indications that suggest endogamy.

Depending on the endogamous population you’re dealing with, those signs aren’t always the same.

If you’re sighing now, I understand – but that’s exactly WHY I wrote this article.

We’re covering a lot of ground, but these road markers are invaluable diagnostic tools.

I’ve previously written about endogamy in the articles:

What’s the Difference Between Pedigree Collapse and Endogamy?
Endogamy and DNA Segments
The Faces of Endogamy – The images in this 2017 article are somewhat out of date, but the concepts are as valid now as they were when I wrote the article.

Let’s start with definitions.

Pedigree Collapse and Endogamy

Pedigree collapse isn’t the same as endogamy. Pedigree collapse is when you have ancestors that repeat in your tree.

In this example, the parents of our DNA tester are first cousins, which means the tester shares great-grandparents on both sides and, of course, the same ancestors from there on back in their tree.

This also means they share more of those ancestors’ DNA than they would normally share.

John Smith and Mary Johnson are both in the tree twice, in the same position as great-grandparents. Normally, Tester Smith would carry approximately 12.5% of each of his great-grandparents’ DNA, assuming for illustration purposes that exactly 50% of each ancestor’s DNA is passed in each generation. In this case, due to pedigree collapse, 25% of Tester Smith’s DNA descends from John Smith, and another 25% descends from Mary Johnson, double what it would normally be. 25% is the amount of DNA contribution normally inherited from grandparents, not great-grandparents.

While we may find first cousin marriages a bit eyebrow-raising today, they were quite common in the past. Both laws and customs varied with the country, time, social norms, and religion.

Pedigree Collapse and Endogamy is NOT the Same

You might think that pedigree collapse and endogamy is one and the same, but there’s a difference. Pedigree collapse can lead to endogamy, but it takes more than one instance of pedigree collapse to morph into endogamy within a population. Population is the key word for endogamy.

The main difference is that pedigree collapse occurs with known ancestors in more recent generations for one person, while endogamy is longer-term and systemic in a group of people.

Picture a group of people, all descended from Tester Smith’s great-grandparents intermarrying. Now you have the beginnings of endogamy. A couple hundred or a few hundred years later, you have true endogamy.

In other words, endogamy is pedigree collapse on a larger scale – think of a village or a church.

My ancestors’ village of Schnait, in Germany, is shown above in 1685. One church and maybe 30 or 40 homes. According to church and other records, the same families had inhabited this village, and region, for generations. It’s a sure bet that both pedigree collapse and endogamy existed in this small community.

If pedigree collapse happens over and over again because there are no other people within the community to marry, then you have endogamy. In other words, with endogamy, you assuredly DO have historical pedigree collapse, generally back in time, often before you can identify those specific ancestors – because everyone descends from the same set of founders.

Endogamy Doesn’t Necessarily Indicate Recent Pedigree Collapse

With deep, historic endogamy, you don’t necessarily have recent pedigree collapse, and in fact, many people do not. Jewish people are a good example of this phenomenon. They shared ancestors for hundreds or thousands of years, depending on which group we are referring to, but in recent, known, generations, many Jewish people aren’t related. Still, their DNA often matches each other.

The good news is that there are telltale signs and signals of endogamy.

The bad news is that not all of these are obvious, meaning as an aid to people seeking clues about unknown close relatives, and other “signs” aren’t what they are believed to be.

Let’s step through each endogamy identifier, or “hint,” and then we will review how we can best utilize this information.

First, let’s take a look at groups that are considered to be endogamous.

Endogamous Groups

Jewish People – Specifically groups that were isolated from other groups of Jewish (and other) people; Ashkenazi (Germany, Northern France, and diaspora), Sephardic (Spanish, Iberia, and diaspora), Mizrahi (Israel, Middle Eastern, and diaspora,) Ethiopian Jews, and possibly Jews from other locations such as Mountain Jews from Kazakhstan and the Caucasus.

Acadians – Descendants of about 60 French families who settled in “Acadia” beginning about 1604, primarily on the island of Nova Scotia, and intermarried among themselves and with the Mi’kmaq people. Expelled by the English in 1755, they were scattered in groups to various diasporic regions where they continued to intermarry and where their descendants are found today. Some Acadians became the Cajuns of Louisiana.

Anabaptist Protestant Faiths – Amish, Mennonite, and Brethren (Dunkards) and their offshoots are Protestant religious sects founded in Europe in the 14^th, 15^th, and 16^th centuries on the principle of baptizing only adults or people who are old enough to choose to follow the faith, or rebaptizing people who had been previously baptized as children. These Anabaptist faiths tend to marry within their own group or church and often expel those who marry outside of the faith. Many emigrated to the American colonies and elsewhere, seeking religious freedom. Occasionally those groups would locate in close proximity and intermarry, but not marry outside of other Anabaptist denominations.

Native American (Indigenous) People – all indigenous peoples found in North and South America before European colonization descended from a small number of original founders who probably arrived at multiple times.

Indigenous Pacific Islanders – Including indigenous peoples of Australia, New Zealand, and Hawaii prior to colonization. They are probably equally as endogamous as Native American people, but I don’t have specific examples to share.

Villages – European or other villages with little inflow or whose residents were restricted from leaving over hundreds of years.

Other groups may have significant multiple lines of pedigree collapse and therefore become endogamous over time. Some people from Newfoundland, French Canadians, and Mormons (Church of Jesus Christ of Latter-Day Saints) come to mind.

Endogamy is a process that occurs over time.

Endogamy and Unknown Relatives

If you know who your relatives are, you may already know you’re from an endogamous population, but if you’re searching for close relatives, it’s helpful to be able to determine if you have endogamous heritage, at least in recent generations.

If you know nothing about either parent, some of these tools won’t help you, at least not initially, but others will. However, as you add to your knowledge base, the other tools will become more useful.

If you know the identity of one parent, this process becomes at least somewhat easier.

In future articles, we will search specifically for parents and each of your four grandparents. In this article, I’ll review each of the diagnostic tools and techniques you can use to determine if you have endogamy, and perhaps pinpoint the source.

The Challenge

People with endogamous heritage are related in multiple, unknown ways, over many generations. They may also be related in known ways in recent generations.

If both of your parents share the SAME endogamous culture or group of relatives:

You may have significantly more autosomal DNA matches than people without endogamy, unless that group of people is under-sampled. Jewish people have significantly more matches, but Native people have fewer due to under-sampling.
You may experience a higher-than-normal cM (centiMorgan) total for estimated relationships, especially more distant relationships, 3C and beyond.
You will have many matches related to you on both your maternal and paternal sides.
Parts of your autosomal DNA will be the same on both your mother’s and father’s sides, meaning your DNA will be fully identical in some locations. (I’ll explain more in a minute.)

If either (or both) of your parents are from an endogamous population, you:

Will, in some cases, carry identifying Y and mitochondrial DNA that points to a specific endogamous group. This is true for Native people, can be true for Jewish people and Pacific Islanders, but is not true for Anabaptist people.

One Size Does NOT Fit All

Please note that there is no “one size fits all.”

Each or any of these tools may provide relevant hints, depending on:

Your heritage
How many other people have tested from the relevant population group
How many close or distant relatives have tested
If your parents share the same heritage
Your unique DNA inheritance pattern
If your parents, individually, were fully endogamous or only partly endogamous, and how far back generationally that endogamy occurred

For example, in my own genealogy, my maternal grandmother’s father was Acadian on his father’s side. While I’m not fully endogamous, I have significantly more matches through that line proportionally than on my other lines.

I have Brethren endogamy on my mother’s side via her paternal grandmother.

Endogamous ancestors are shown with red stars on my mother’s pedigree chart, above. However, please note that her maternal and paternal endogamous ancestors are not from the same endogamous population.

However, I STILL have fewer matches on my mother’s side in total than on my father’s side because my mother has recent Dutch and recent German immigrants which reduces her total number of matches. Neither of those lines have had as much time to produce descendants in the US, and Europe is under-sampled when compared with the US where more people tend to take DNA tests because they are searching for where they came from.

My father’s ancestors have been in the US since it was a British Colony, and I have many more cousins who have tested on his side than mother’s.

If you looked at my pedigree chart and thought to yourself, “that’s messy,” you’d be right.

The “endogamy means more matches” axiom does not hold true for me, comparatively, between my parents – in part because my mother’s German and Dutch lines are such recent immigrants.

The number of matches alone isn’t going to tell this story.

We are going to need to look at several pieces and parts for more information. Let’s start with ethnicity.

Ethnicity and Populations

Ethnicity can be a double-edged sword. It can tell you exactly nothing you couldn’t discern by looking in the mirror, or, conversely, it can be a wealth of information.

Ethnicity reveals the parts of the world where your ancestors originated. When searching for recent ancestors, you’re most interested in majority ethnicity, meaning the 50% of your DNA that you received from each of your parents.

Ethnicity results at each vendor are easy to find and relatively easy to understand.

This individual at FamilyTreeDNA is 100% Ashkenazi Jewish.

If they were 50% Jewish, we could then estimate, and that’s an important word, that either one of their parents was fully Jewish, and not the other, or that two of their grandparents were Jewish, although not necessarily on the same side.

On the other hand, my mother’s ethnicity, shown below, has nothing remarkable that would point to any majority endogamous population, yet she has two.

The only hint of endogamy from ethnicity would be her ~1% Americas, and that isn’t relevant for finding close relatives. However, minority ancestry is very relevant for identifying Native ancestors, which I wrote about, here.

You can correlate or track your ethnicity segments to specific ancestors, which I discussed in the article, Native American & Minority Ancestors Identified Using DNAPainter Plus Ethnicity Segments, here.

Since I wrote that article, FamilyTreeDNA has added the feature of ethnicity or population Chromosome Painting, based on where each of your populations fall on your chromosomes.

In this example on chromosome 1, I have European ancestry (blue,) except for the pink Native segment, which occurs on the following segment in the same location on my mother’s chromosome 1 as well.

Both 23andMe, and FamilyTreeDNA provide chromosome painting AND the associated segment information so you can identify the relevant ancestors.

Ancestry is in the process of rolling out an ethnicity painting feature, BUT, it has no segment or associated matching information. While it’s interesting eye candy, it’s not terribly useful beyond the ethnicity information that Ancestry already provides. However, Jonny Perl at DNAPainter has devised a way to estimate Ancestry’s start and stop locations, here. Way to go Jonny!

Now all you need to do is convince your Ancestry matches to upload their DNA file to one of the three databases, FamilyTreeDNA, MyHeritage, and GEDMatch, that accept transfers, aka uploads. This allows matching with segment data so that you can identify who matches you on that segment, track your ancestors, and paint your ancestral segments at DNAPainter.

I provided step-by-step instructions, here, for downloading your raw DNA file from each vendor in order to upload the file to another vendor.

Ethnicity Sides

Three of the four DNA testing vendors, 23andMe, FamilyTreeDNA, and recently, Ancestry, attempt to phase your ethnicity DNA, meaning to assign it to one parental “side” or the other – both in total and on each chromosome.

Here’s Ancestry’s SideView, where your DNA is estimated to belong to parent 1 and parent 2. I detailed how to determine which side is which, here, and while that article was written specifically pertaining to Ancestry’s SideView, the technique is relevant for all the vendors who attempt to divide your DNA into parents, a technique known as phasing.

I say “attempt” because phasing may or may not be accurate, meaning the top chromosome may not always be parent 1, and the bottom chromosome may not always be chromosome 2.

Here’s an example at 23andMe.

See the two yellow segments. They are both assigned as Native. I happen to know one is from the mother and one is from the father, yet they are both displayed on the “top” chromosome, which one would interpret to be the same parent.

I am absolutely positive this is not the case because this is a close family member, and I have the DNA of the parent who contributed the Native segment on chromosome 1, on the top chromosome. That parent does not have a Native segment on chromosome 2 to contribute. So that Native segment had to be contributed by the other parent, but it’s also shown on the top chromosome.

The DNA segments circled in purple belong together on the same “side” and were contributed to the tester by the same parent. The Native segment on chromosome 2 abuts a purple African segment, suggesting perhaps that the ancestor who contributed that segment was mixed between those ethnicities. In the US, that suggests enslavement.

The other African segments, circled, are shown on the second chromosome in each pair.

To be clear, parent 1 is not assigned by the vendors to either mother or father and will differ by person. Your parent 1, or the parent on the top chromosome may be your mother and another person’s parent 1 may be their father.

As shown in this example, parents can vary by chromosome, a phenomenon known as “strand swap.” Occasionally, the DNA can even be swapped within a chromosome assignment.

You can, however, get an idea of the division of your DNA at any specific location. As shown above, you can only have a maximum of two populations of DNA on any one chromosome location.

In our example above, this person’s majority ancestry is European (blue.) On each chromosome where we find a minority segment, the opposite chromosome in the same location is European, meaning blue.

Let’s look at another example.

At FamilyTreeDNA, the person whose ethnicity painting is shown below has a Native American (pink) ancestor on their father’s side. FamilyTreeDNA has correctly phased or identified their Native segments as all belonging to the second chromosome in each pair.

Looking at chromosome 18, for example, most of their father’s chromosome is Native American (pink). The other parent’s chromosome is European (dark blue) at those same locations.

If one of the parents was of one ethnicity, and the other parent is a completely different ethnicity, then one bar of each chromosome would be all pink, for example, and one would be entirely blue, representing the other ethnicity.

Phasing ethnicity or populations to maternal and paternal sides is not foolproof, and each chromosome is phased individually.

Ethnicity can, in some cases, give you a really good idea of what you’re dealing with in terms of heritage and endogamy.

If someone had an Ashkenazi Jewish father and European mother, for example, one copy of each chromosome would be yellow (Ashkenazi Jewish), and one would be blue (European.)

However, if each of their parents were half European Jewish and half European (not Jewish), then their different colored segments would be scattered across their entire set of chromosomes.

In this case, both of the tester’s parents are mixed – European Jewish (green) and Western Europe (blue.) We know both parents are admixed from the same two populations because in some locations, both parents contributed blue (Western Europe), and in other locations, both contributed Jewish (green) segments.

Both MyHeritage and Ancestry provide a secondary tool that’s connected to ethnicity, but different and generally in more recent times.

Ancestry’s DNA Communities

While your ethnicity may not point to anything terribly exciting in terms of endogamy, Genetic Communities might. Ancestry says that a DNA Community is a group of people who share DNA because their relatives recently lived in the same place at the same time, and that communities are much smaller than ethnicity regions and reach back only about 50-300 years.

Based on the ancestors’ locations in the trees of me and my matches, Ancestry has determined that I’m connected to two communities. In my case, the blue group is clearly my father’s line. The orange group could be either parent, or even a combination of both.

My endogamous Brethren could be showing up in Maryland, Pennsylvania, and Ohio, but it’s uncertain, in part, because my father’s ancestral lines are found in Virginia, West Virginia, and Maryland too.

These aren’t useful for me, but they may be more useful for fully endogamous people, especially in conjunction with ethnicity.

My Acadian cousin’s European ethnicity isn’t informative.

However, viewing his DNA Communities puts his French heritage into perspective, especially combined with his match surnames.

I wrote about DNA Communities when it was introduced with the name Genetic Communities, here.

MyHeritage’s Genetic Groups

MyHeritage also provides a similar feature that shows where my matches’ ancestors lived in the same locations as mine.

One difference, though, is that testers can adjust their ethnicity results confidence level from high, above, to low, below where one of my Genetic Groups overlaps my ethnicity in the Netherlands.

You can also sort your matches by Genetic Groups.

The results show you not only who is in the group, but how many of your matches are in that group too, which provides perspective.

I wrote about Genetic Groups, here.

Next, let’s look at how endogamy affects your matches.

Matches

The number of matches that a person has who is from an entirely endogamous community and a person with no endogamy may be quite different.

FamilyTreeDNA provides a Family Matching feature that triangulates your matches and assigns them to your paternal or maternal side by using known matches that you have linked to their profile cards in your tree. You must link people for the Family Matching feature known as “bucketing” to be enabled.

The people you link are then processed for shared matches on the same chromosome segment(s). Triangulated individuals are then deposited in your maternal, paternal, and both buckets.

Obviously, your two parents are the best people to link, but if they haven’t tested (or uploaded their DNA file from another vendor) and you have other known relatives, link them using the Family Tree tab at the top of your personal page.

I uploaded my Ancestry V4 kit to use as an example for linking. Let’s pretend that’s my sister. If I had not already linked my Ancestry V4 kit to “my sister’s” profile card, I’d want to do that and link other known individuals the same way. Just drag and drop the match to the correct profile card.

Note that a full or half sibling will be listed as such at FamilyTreeDNA, but an identical twin will show as a potential parent/child match to you. You’re much more likely to find a parent than an identical twin, but just be aware.

I’ve created a table of FamilyTreeDNA bucketed match results, by category, comparing the number of matches in endogamous categories with non-endogamous.

	Total Matches	Maternal Matches	Paternal Matches	Both	% Both	% DNA Unassigned
100% Jewish	34,637	11,329	10,416	4,806	13.9	23.3
100% Jewish	32,973	10,700	9,858	4,606	14	23.7
100% Jewish	32,255	9,060	10,970	3,892	12	25.8
75% Jewish	24,232	11,846	Only mother linked	Only mother linked	Only mother linked
100% Acadian	8093	3826	2299	1062	13	11
100% Acadian	7828	3763	1825	923	11.8	17
Not Endogamous	6760	3845	1909	13	0.19	14.5
Not Endogamous	7723	1470	3317	6	0.08	38
100% Native American	1,115	Unlinked	Unlinked	Unlinked
100% Native American	885	290	Unknown	Can’t calculate without at least one link on both sides

The 100% Jewish, Acadian, and Not Endogamous testers both have linked their parents, so their matches, if valid (meaning not identical by chance, which I discussed here,) will match them plus one or the other parent.

One person is 75% Jewish and has only linked their Jewish mother.

The Native people have not tested their parents, and the first Native person has not linked anyone in their tree. The second Native person has only linked a few maternal matches, but their mother has not tested. They are seeking their father.

It’s very difficult to find people who are fully Native as testers. Furthermore, Native people are under-sampled. If anyone knows of fully Native (or other endogamous) people who have tested and linked their parents or known relatives in their trees, and will allow me to use their total match numbers anonymously, please let me know.

As you can see, Jewish, Acadian, and Native people are 100% endogamous, but many more Jewish people than Native people have tested, so you CAN’T judge endogamy by the total number of matches alone.

In fact, in order:

Fully Jewish testers have about 4-5 times as many matches as the Acadian and Non-endogamous testers
Acadian and Non-endogamous testers have about 5-6 times as many matches as the Native American testers
Fully Jewish people have about 30 times more matches than the Native American testers

If a person’s endogamy with a particular population is only on their maternal or paternal side, they won’t have a significant number of people related to both sides, meaning few people will fall into the “Both” bucket. People that will always be found in the ”Both” bucket are full siblings and their descendants, along with descendants of the tester, assuming their match is linked to their profiles in the tester’s tree.

In the case of our Jewish testers, you can easily see that the “Both” bucket is very high. The Acadians are also higher than one would reasonably expect without endogamy. A non-endogamous person might have a few matches on both sides, assuming the parents are not related to each other.

A high number of “Both” matches is a very good indicator of endogamy within the same population on both parents’ sides.

The percentage of people who are assigned to the “Both” bucket is between 11% and 14% in the endogamous groups, and less than 1% in the non-endogamous group, so statistically not relevant.

As demonstrated by the Native people compared to the Jewish testers, the total number of matches can be deceiving.

However, being related to both parents, as indicated by the “Both” bucket, unless you have pedigree collapse, is a good indicator of endogamy.

Of course, if you don’t know who your relatives are, you can’t link them in your tree, so this type of “hunt” won’t generally help people seeking their close family members.

However, you may notice that you’re matching people PLUS both of their parents. If that’s the case, start asking questions of those matches about their heritage.

A very high number of total matches, as compared to non-endogamous people, combined with some other hints might well point to Jewish heritage.

I included the % DNA Unassigned category because this category, when both parents are linked, is the percentage of matches by chance, meaning the match doesn’t match either of the tester’s parents. All of the people with people listed in “Both” categories have linked both of their parents, not just maternal and paternal relatives.

Matching Location at MyHeritage

MyHeritage provides a matching function by location. Please note that it’s the location of the tester, but that may still be quite useful.

The locations are shown in the most-matches to least-matches order. Clicking on the location shows the people who match you who are from that location. This would be the most useful in situations where recent immigration has occurred. In my case, my great-grandfather from the Netherlands arrived in the 1860s, and my German ancestors arrived in the 1850s. Neither of those groups are endogamous, though, unless it would be on a village level.

AutoClusters

Let’s shift to Genetic Affairs, a third-party tool available to everyone.

Using their AutoCluster function, Genetic Affairs clusters your matches together who match both each other and you.

This is an example of the first few clusters in my AutoCluster. You can see that I have several colored clusters of various sizes, but none are huge.

Compare that to the following endogamous cluster, sample courtesy of EJ Blom at Genetic Affairs.

If your AutoCluster at Genetic Affairs looks something like this, a huge orange blob in the upper left hand corner, you’re dealing with endogamy.

Please also note that the size of your cluster is also a function of both the number of testers and the match threshold you select. I always begin by using the defaults. I wrote about using Genetic Affairs, here.

If you tested at or transferred to MyHeritage, they too license AutoClusters, but have optimized the algorithm to tease out endogamous matches so that their Jewish customers, in particular, don’t wind up with a huge orange block of interrelated people.

You won’t see the “endogamy signature” huge cluster in the corner, so you’re less likely to be able to discern endogamy from a MyHeritage cluster alone.

The commonality between these Jewish clusters at MyHeritage is that they all tend to be rather uniform in size and small, with lots of grey connecting almost all the blocks.

Grey cells indicate people who match people in two colored groups. In other words, there is often no clear division in clusters between the mother’s side and the father’s side in Jewish clusters.

In non-endogamous situations, even if you can’t identify the parents, the clusters should still fall into two sides, meaning a group of clusters for each parent’s side that are not related to each other.

You can read more about Genetic Affairs clusters and their tools, here. DNAGedcom.com also provides a clustering tool.

Endogamous Relationships

Endogamous estimated relationships are sometimes high. Please note the word, “sometimes.”

Using the Shared cM Project tool relationship chart, here, at DNAPainter, people with heavy endogamy will discover that estimated relationships MAY be on the high side, or the relationships may, perhaps, be estimated too “close” in time. That’s especially true for more distant relationships, but surprisingly, it’s not always true. The randomness of inheritance still comes into play, and so do potential unknown relatives. Hence, the words “may” are bolded and underscored.

Unfortunately, it’s often stated as “conventional wisdom” that Jewish matches are “always” high, and first cousins appear as siblings. Let’s see what the actual data says.

At DNAPainter, you can either enter the amount of shared DNA (cM), or the percent of shared DNA, or just use the chart provided.

I’ve assembled a compilation of close relationships in kits that I have access to or from people who were generous enough to share their results for this article.

I’ve used Jewish results, which is a highly endogamous population, compared with non-endogamous testers.

The “Jewish Actual” column reports the total amount of shared DNA with that person. In other words, someone to their grandparent. The Average Range is the average plus the range from DNAPainter. The Percent Difference is the % difference between the actual number and the DNAPainter average.

You’ll see fully Jewish testers, at left, matching with their family members, and a Non-endogamous person, at right, matching with their same relative.

Relationship	Jewish Actual	Percent Difference than Average	Average -Range	Non-endogamous Actual	Percent Difference than Average
Grandparent	2141	22	1754 (984-2482)	1742	<1 lower
Grandparent	1902	8.5	1754 (984-2482)	1973	12
Sibling	3039	16	2613 (1613-3488)	2515	3.5 lower
Sibling	2724	4	2613 (1613-3488)	2761	5.5
Half-Sibling	2184	24	1759 (1160-2436)	2127	21
Half-Sibling	2128	21	1759 (1160-2436)	2352	34
Aunt/Uncle	2066	18.5	1741 (1201-2282)	1849	6
Aunt/Uncle	2031	16.5	1741 (1201-2282)	2097	20
1C	1119	29	866 (396-1397)	959	11
1C	909	5	866 (396-1397)	789	9 lower
1C1R	514	19	433 (102-980)	467	8
1C1R	459	6	433 (102-980)	395	9 lower

These totals are from FamilyTreeDNA except one from GEDMatch (one Jewish Half-sibling).

Totals may vary by vendor, even when matching with the same person. 23andMe includes the X segments in the total cMs and also counts fully identical segments twice. MyHeritage imputation seems to err on the generous side.

However, in these dozen examples:

You can see that the Jewish actual amount of DNA shared is always more than the average in the estimate.
The red means the overage is more than 100 cM larger.
The percentage difference is probably more meaningful because 100 cM is a smaller percentage of a 1754 grandparent connection than compared to a 433 cM 1C1R.

However, you can’t tell anything about endogamy by just looking at any one sample, because:

Some of the Non-Endogamous matches are high too. That’s just the way of random inheritance.
All of the actual Jewish match numbers are within the published ranges, but on the high side.

Furthermore, it can get more complex.

Half Endogamous

I requested assistance from Jewish genealogy researchers, and a lovely lady, Sharon, reached out, compiled her segment information, and shared it with me, granting permission to share with you. A HUGE thank you to Sharon!

Sharon is half-Jewish via one parent, and her half-sibling is fully Jewish. Their half-sibling match to each other at Ancestry is 1756 cM with a longest segment of 164 cM.

How does Jewish matching vary if you’re half-Jewish versus fully Jewish? Let’s look at 21 people who match both Sharon and her fully Jewish half-sibling.

Sharon shared the differences in 21 known Jewish matches with her and her half-sibling. I’ve added the Relationship Estimate Range from DNAPainter and colorized the highest of the two matches in yellow. Bolding in the total cM column shows a value above the average range for that relationship.

Total Matching cMs is on the left, with Longest Segment on the right.

While this is clearly not a scientific study, it is a representative sample.

The fully Jewish sibling carries more Jewish DNA, which is available for other Jewish matches to match as a function of endogamy (identical by chance/population), so I would have expected the fully Jewish sibling to match most if not all Jewish testers at a higher level than the half-Jewish sibling.

However, that’s not universally what we see.

The fully Jewish sibling is not always the sibling with the highest number of matches to the other Jewish testers, although the half-Jewish tester has the larger “Longest Segment” more often than not.

Approximately two-thirds of the time (13/21), the fully Jewish person does have a higher total matching cM, but about one-third of the time (8/21), the half-Jewish sibling has a higher matching cM.

About one-fourth of the time (5/21), the fully Jewish sibling has the longest matching segment, and about two-thirds of the time (13/21), the half-Jewish sibling does. In three cases, or about 14% of the time, the longest segment is equal which may indicate that it’s the same segment.

Because of endogamy, Jewish matches are more likely to have:

Larger than average total cM for the specific relationship
More and smaller matching segments

However, as we have seen, neither of those are definitive, nor always true. Jewish matches and relationships are not always overestimated.

Ancestry and Timber

Please note that Ancestry downweights some matches by removing some segments using their Timber algorithm. Based on my matches and other accounts that I manage, Ancestry does not downweight in the 2-3^rd cousin category, which is 90 cM and above, but they do begin downweighting in the 3-4^th cousin category, below 90 cM, where my “Extended Family” category begins.

If you’ve tested at Ancestry, you can check for yourself.

By clicking on the amount of DNA you share with your match on your match list at Ancestry, shown above, you will be taken to another page where you will be able to view the unweighted shared DNA with that match, meaning the amount of DNA shared before the downweighting and removal of some segments, shown below.

Given the downweighting, and the information in the spreadsheet provided by Sharon, it doesn’t appear that any of those matches would have been in a category to be downweighted.

Therefore, for these and other close matches, Timber wouldn’t be a factor, but would potentially be in more distant matches.

Endogamous Segments

Endogamous matches tend to have smaller and more segments. Small amounts of matching DNA tend to skew the total DNA cM upwards.

How and why does this happen?

Ancestral DNA from further back in time tends to be broken into smaller segments.

Sometimes, especially in endogamous situations, two smaller segments, at one time separated from each other, manage to join back together again and form a match, but the match is only due to ancestral segments – not because of a recent ancestor.

Please note that different vendors have different minimum matching cM thresholds, so smaller matches may not be available at all vendors. Remember that factors like Timber and imputation can affect matching as well.

Let’s take a look at an example. I’ve created a chart where two ancestors have their blue and pink DNA broken into 4 cM segments.

They have children, a blue child and a pink child, and the two children, shown above, each inherited the same blue 4 cM segment and the same pink 4 cM segment from their respective parents. The other unlabeled pink and blue segments are not inherited by these two children, so those unlabeled segments are irrelevant in this example.

The parents may have had other children who inherited those same 4 cM labeled pink and blue segments as well, and if not, the parents’ siblings were probably passing at least some of the same DNA down to their descendants too.

The blue and pink children had children, and their children had children – for several generations.

Time passed, and their descendants became an endogamous community. Those pink and blue 4 cM segments may at some time be lost during recombination in the descendants of each of their children, shown by “Lost pink” and “Lost blue.”

However, because there is only a very limited amount of DNA within the endogamous community, their descendants may regain those same segments again from their “other parent” during recombination, downstream.

In each generation, the DNA of the descendant carrying the original blue or pink DNA segment is recombined with their partner. Given that the partners are both members of the same endogamous community, the two people may have the same pink and/or blue DNA segments. If one parent doesn’t carry the pink 4 cM segment, for example, their offspring may receive that ancestral pink segment from the other parent.

They could potentially, and sometimes do, receive that ancestral segment from both parents.

In our example, the descendants of the blue child, at left, lost the pink 4 cM segment in generation 3, but a few generations later, in generation 11, that descendant child inherited that same pink 4 cM segment from their other parent. Therefore, both the 4 cM blue and 4 cM pink segments are now available to be inherited by the descendants in that line. I’ve shown the opposite scenario in the generational inheritance at right where the blue segment is lost and regained.

Once rejoined, that pink and blue segment can be passed along together for generations.

The important part, though, is that once those two segments butt up against each other again during recombination, they aren’t just two separate 4 cM segments, but one segment that is 8 cM long – that is now equal to or above the vendors’ matching threshold.

This is why people descended from endogamous populations often have the following matching characteristics:

More matches
Many smaller segment matches
Their total cM is often broken into more, smaller segments

What does more, smaller segments, look like, exactly?

More, Smaller Segments

All of our vendors except Ancestry have a chromosome browser for their customers to compare their DNA to that of their matches visually.

Let’s take a look at some examples of what endogamous and non-endogamous matches look like.

For example, here’s a screen shot of a random Jewish second cousin match – 298 cM total, divided into 12 segments, with a longest segment of 58 cM,

A second Jewish 2C with 323 cM total, across 19 segments, with a 69 cM longest block.

A fully Acadian 2C match with 600 cM total, across 27 segments, with a longest segment of 69 cM.

A second Acadian 2C with 332 cM total, across 20 segments, with a longest segment of 42 cM.

Next, a non-endogamous 2C match with 217 cM, across 7 segments, with a longest segment of 72 cM.

Here’s another non-endogamous 2C example, with 169 shared cM, across 6 segments, with a longest segment of 70 cM.

Here’s the second cousin data in a summary table. The take-away from this is the proportion of total segments

Tester Population	Total cM	Longest Block	Total Segments
Jewish 2C	298	58	12
Jewish 2C	323	69	19
Acadian 2C	600	69	27
Acadian 2C	332	42	20
Non-endogamous 2C	217	72	7
Non-endogamous 2C	169	70	6

You can see more examples and comparisons between Native American, Jewish and non-endogamous DNA individuals in the article, Concepts – Endogamy and DNA Segments.

I suspect that a savvy mathematician could predict endogamy based on longest block and total segment information.

Lara Diamond, a mathematician, who writes at Lara’s Jewnealogy might be up for this challenge. She just published compiled matching and segment information in her Ashkenazic Shared DNA Survey Results for those who are interested. You can also contribute to Laura’s data, here.

Endogamy, Segments, and Distant Relationships

While not relevant to searching for close relatives, heavily endogamous matches 3C and more distant, to quote one of my Jewish friends, “dissolve into a quagmire of endogamy and are exceedingly difficult to unravel.”

In my own Acadian endogamous line, I often simply have to label them “Acadian” because the DNA tracks back to so many ancestors in different lines. In other words, I can’t tell which ancestor the match is actually pointing to because the same DNA segments or segments is/are carried by several ancestors and their descendants due to founder effect.

The difference with the Acadians is that we can actually identify many or most of them, at least at some point in time. As my cousin, Paul LeBlanc, once said, if you’re related to one Acadian, you’re related to all Acadians. Then he proceeded to tell me that he and I are related 137 different ways. My head hurts!

It’s no wonder that endogamy is incredibly difficult beyond the first few generations when it turns into something like multi-colored jello soup.

“Are Your Parents Related?” Tool

There’s another tool that you can utilize to determine if your parents are related to each other.

To determine if your parents are related to each other, you need to know about ROH, or Runs of Homozygosity (ROH).

ROH means that the DNA on both strands or copies of the same chromosome is identical.

For a few locations in a row, ROH can easily happen just by chance, but the longer the segment, the less likely that commonality occurs simply by chance.

The good news is that you don’t need to know the identity of either of your parents. You don’t need either of your parent’s DNA tests – just your own. You’ll need to upload your DNA file to GEDmatch, which is free.

Click on “Are your parents related?”

GEDMatch analyzes your DNA to see if any of your DNA, above a reasonable matching threshold, is identical on both strands, indicating that you inherited the exact same DNA from both of your parents.

A legitimate match, meaning one that’s not by chance, will include many contiguous matching locations, generally a minimum of 500 SNPs or locations in a row. GEDmatch’s minimum threshold for identifying identical ancestral DNA (ROH) is 200 cM.

Here’s my result, including the graphic for the first two chromosomes. Notice the tiny green bars that show identical by chance tiny sliver segments.

I have no significant identical DNA, meaning my parents are not related to each other.

Next, let’s look at an endogamous example where there are small, completely identical segments across a person’s chromosome

This person’s Acadian parents are related to each other, but distantly.

Next, let’s look at a Jewish person’s results.

You’ll notice larger green matching ROH, but not over 200 contiguous SNPs and 7 cM.

GEDMatch reports that this Jewish person’s parents are probably not related within recent generations, but it’s clear that they do share DNA in common.

People whose parents are distantly related have relatively small, scattered matching segments. However, if you’re seeing larger ROH segments that would be large enough to match in a genealogical setting, meaning multiple greater than 7 cM and 500 SNPs,, you may be dealing with a different type of situation where cousins have married in recent generations. The larger the matching segments, generally, the closer in time.

Blogger Kitty Cooper wrote an article, here, about discovering that your parents are related at the first cousin level, and what their GEDMatch “Are Your Parents Related” results look like.

Let’s look for more clues.

Surnames

There MAY be an endogamy clue in the surnames of the people you match.

Viewing surnames is easier if you download your match list, which you can do at every vendor except Ancestry. I’m not referring to the segment data, but the information about your matches themselves.

I provided instructions in the recent article, How to Download Your DNA Match Lists and Segment Files, here.

If you suspect endogamy for any reason, look at your closest matches and see if there is a discernable trend in the surnames, or locations, or any commonality between your matches to each other.

For example, Jewish, Acadian, and Native surnames may be recognizable, as may locations.

You can evaluate in either or both of two ways:

The surnames of your closest matches. Closest matches listed first will be your default match order.
Your most frequently occurring surnames, minus extremely common names like Smith, Jones, etc., unless they are also in your closest matches. To utilize this type of matching, sort the spreadsheet in surname order and then scan or count the number of people with each surname.

Here are some examples from our testers.

Jewish – Closest surname matches.

Roth
Weiss
Goldman
Schonwald
Levi
Cohen
Slavin
Goodman
Sender
Trebatch

Acadian – Closest surname matches.

Bergeron
Hebert
Bergeron
Marcum
Muise
Legere
Gaudet
Perry
Verlander
Trombley

Native American – Closest surname matches.

Ortega
Begay
Valentine
Hayes
Montoya
Sun Bear
Martin
Tsosie
Chiquito
Yazzie

You may recognize these categories of surnames immediately.

If not, Google is your friend. Eliminate common surnames, then Google for a few together at a time and see what emerges.

The most unusual surnames are likely your best bets.

Projects

Another way to get some idea of what groups people with these surnames might belong to is to enter the surname in the FamilyTreeDNA surname search.

Go to the main FamilyTreeDNA page, but DO NOT sign on.

Scroll down until you see this image.

Type the surname into the search box. You’ll see how many people have tested with that surname, along with projects where project administrators have included that surname indicating that the project may be of interest to at least some people with that surname.

Here’s a portion of the project list for Cohen, a traditional Jewish surname.

These results are for Muise, an Acadian surname.

Clicking through to relevant surname projects, and potentially contacting the volunteer project administrator can go a very long way in helping you gather and sift information. Clearly, they have an interest in this topic.

For example, here’s the Muise surname in the Acadian AmerIndian project. Two great hints here – Acadian heritage and Halifax, Nova Scotia.

Repeat for the balance of surnames on your list to look for commonalities, including locations on the public project pages.

Locations

Some of the vendor match files include location information. Each person on your match list will have the opportunity at the vendor where they tested to include location information in a variety of ways, either for their ancestors or themselves.

Where possible, it’s easiest to sort or scan the download file for this type of information.

Ancestry does not provide or facilitate a match list, but you can still create your own for your closest 20 or 30 matches in a spreadsheet.

MyHeritage provides common surname and ancestral location information for every match. How cool is that!

Y DNA, Mitochondrial DNA, and Endogamy

Haplogroups for both Y and mitochondrial DNA can indicate and sometimes confirm endogamy. In other cases, the haplogroup won’t help, but the matches and their location information just might.

FamilyTreeDNA is the only vendor that provides Y DNA and mitochondrial DNA tests that include highly granular haplogroups along with matches and additional tools.

23andMe provides high-level haplogroups which may or may not be adequate to pinpoint a haplogroup that indicates endogamy.

Of course, only males carry Y DNA that tracks to the direct paternal (surname) line, but everyone carries their mother’s mitochondrial DNA that represents their mother’s mother’s mother’s, or direct matrilineal line.

Some haplogroups are known to be closely associated with particular ethnicities or populations, like Native Americans, Pacific Islanders, and some Jewish people.

Haplogroups reach back in time before genealogy and can give us a sense of community that’s not available by either looking in the mirror or through traditional records.

This Native American man is a member of high-level haplogroup Q-M242. However, some men who carry this haplogroup are not Native, but are of European or Middle Eastern origin.

I entered the haplogroup in the FamilyTreeDNA Discover tool, which I wrote about, here.

Checking the information about this haplogroup reveals that their common ancestor descended from an Asian man about 30,000 years ago.

The migration path in the Americans explains why this person would have an endogamous heritage.

Our tester would receive a much more refined haplogroup if he upgraded to the Big Y test at FamilyTreeDNA, which would remove all doubt.

However, even without additional testing, information about his matches at FamilyTreeDNA may be very illuminating.

The Q-M242 Native man’s Y DNA matches men with more granular haplogroups, shown above, at left. On the Haplogroup Origins report, you can see that these people have all selected the “US (Native American)” country option.

Another useful tool would be to check the public Y haplotree, here, and the public mitochondrial tree here, for self-reported ancestor location information for a specific haplogroup.

Here’s an example of mitochondrial haplogroup A2 and a few subclades on the public mitochondrial tree. You can see that the haplogroup is found in Mexico, the US (Native,) Canada, and many additional Caribbean, South, and Central American countries.

Of course, Y DNA and mitochondrial DNA (mtDNA) tell a laser-focused story of one specific line, each. The great news, if you’re seeking information about your mother or father, the Y is your father’s direct paternal (surname) line, and mitochondrial is your mother’s direct matrilineal line.

Y and mitochondrial DNA results combined with ethnicity, autosomal matching, and the wide range of other tools that open doors, you will be able to reveal a great deal of information about whether you have endogamous heritage or not – and if so, from where.

I’ve provided a resource for stepping through and interpreting your Y DNA results, here, and mitochondrial DNA, here.

Discover for Y DNA Only

If you’re a female, you may feel left out of Y DNA testing and what it can tell you about your heritage. However, there’s a back door.

You can utilize the Y DNA haplogroups of your closest autosomal matches at both FamilyTreeDNA and 23andMe to reveal information

Haplogroup information is available in the download files for both vendors, in addition to the Family Finder table view, below, at FamilyTreeDNA, or on your individual matches profile cards at both 23andMe and FamilyTreeDNA.

You can enter any Y DNA haplogroup in the FamilyTreeDNA Discover tool, here.

You’ll be treated to:

Your Haplogroup Story – how many testers have this haplogroup (so far), where the haplogroup is from, and the haplogroup’s age. In this case, the haplogroup was born in the Netherlands about 250 years ago, give or take 200 years. I know that it was 1806 or earlier based on the common ancestor of the men who tested.
Country Frequency – heat map of where the haplogroup is found in the world.
Notable Connections – famous and infamous (this haplogroup’s closest notable person is Leo Tolstoy).
Migration Map – migration path out of Africa and through the rest of the world.
Ancient Connections – ancient burials. His closest ancient match is from about 1000 years ago in Ukraine. Their shared ancestor lived about 2000 years ago.
Suggested Projects – based on the surname, projects that other matches have joined, and haplogroups.
Scientific Details – age estimates, confidence intervals, graphs, and the mutations that define this haplogroup.

I wrote about the Discover tool in the article, FamilyTreeDNA DISCOVER Launches – Including Y DNA Haplogroup Ages.

Endogamy Tools Summary Tables

Endogamy is a tough nut sometimes, especially if you’re starting from scratch. In order to make this topic a bit easier and to create a reference tool for you, I’ve created three summary tables.

Various endogamy-related tools available at each vendor which will or may assist with evaluating endogamy
Tools and their ability to detect endogamy in different groups
Tools best suited to assist people seeking information about unknown parents or grandparents

Summary of Endogamy Tools by Vendor

Please note that GEDMatch is not a DNA testing vendor, but they accept uploads and do have some tools that the testing vendors do not.

Tool	23andMe	Ancestry	FamilyTreeDNA	MyHeritage	GEDMatch
Ethnicity	Yes	Yes	Yes	Yes	Use the vendors
Ethnicity Painting	Yes + segments	Yes, limited	Yes + segments	Yes
Ethnicity Phasing	Yes	Partial	Yes	No
DNA Communities	No	Yes	No	No
Genetic Groups	No	No	No	Yes
Family Matching aka Bucketing	No	No	Yes	No
Chromosome Browser	Yes	No	Yes	Yes	Yes
AutoClusters	Through Genetic Affairs	No	Through Genetic Affairs	Yes, included	Yes, with subscription
Match List Download	Yes, restricted # of matches	No	Yes	Yes	Yes
Projects	No	No	Yes	No
Y DNA	High-level haplogroup only	No	Yes, full haplogroup with Big Y, matching, tools, Discover	No
Mitochondrial DNA	High-level haplogroup only	No	Yes, full haplogroup with mtFull, matching, tools	No
Public Y Tree	No	No	Yes	No
Public Mito Tree	No	No	Yes	No
Discover Y DNA – public	No	No	Yes	No
ROH	No	No	No	No	Yes

Summary of Endogamous Populations Identified by Each Tool

The following chart provides a guideline for which tools are useful for the following types of endogamous groups. Bolded tools require that both parents be descended from the same endogamous group, but several other tools give more definitive results with higher amounts of endogamy.

Y and mitochondrial DNA testing are not affected by admixture, autosomal DNA or anything from the “other” parent.

Tool	Jewish	Acadian	Anabaptist	Native	Other/General
Ethnicity	Yes	No	No	Yes	Pacific Islander
Ethnicity Painting	Yes	No	No	Yes	Pacific Islander
Ethnicity Phasing	Yes, if different	No	No	Yes, if different	Pacific Islander, if different
DNA Communities	Yes	Possibly	Possibly	Yes	Pacific Islander
Genetic Groups	Yes	Possibly	Possibly	Yes	Pacific Islander
Family Matching aka Bucketing	Yes	Yes	Possibly	Yes	Pacific Islander
Chromosome Browser	Possibly	Possibly	Yes, once segments or ancestors identified	Possibly	Pacific Islander, possibly
Total Matches	Yes, compared to non-endogamous	No	No	No	No, unknown
AutoClusters	Yes	Yes	Uncertain, probably	Yes	Pacific Islander
Estimated Relationships High	Not always	Sometimes	No	Sometimes	Uncertain, probably
Relationship Range High	Possibly, sometimes	Possibly	Possibly	Possibly	Pacific Islander, possibly
More, Smaller Segments	Yes	Yes	Probably	Yes	Pacific Islander, probably
Parents Related	Some but minimal	Possibly	Uncertain	Probably similar to Jewish	Uncertain, Possibly
Surnames	Probably	Probably	Probably Not	Possibly	Possibly
Locations	Possibly	Probably	Probably Not	Probably	Probably Pacific Islander
Projects	Probably	Probably	Possibly	Possibly	Probably Pacific Islander
Y DNA	Yes, often	Yes, often	No	Yes	Pacific Islander
Mitochondrial DNA	Yes, often	Sometimes	No	Yes	Pacific Islander
Y public tree	Probably not alone	No	No	Yes	Pacific Islander
MtDNA public tree	Probably not	No	No	Yes	Pacific Islander
Y DNA Discover	Yes	Possibly	Probably not, maybe projects	Yes	Pacific Islander

Summary of Endogamy Tools to Assist People Seeking Unknown Parents and Grandparents

This table provides a summary of when each of the various tools can be useful to:

People seeking unknown close relatives
People who already know who their close relatives are, but are seeking additional information or clues about their genealogy

I considered rating these on a 1 to 10 scale, but the relative usefulness of these tools is dependent on many factors, so different tools will be more or less useful to different people.

For example, ethnicity is very useful if someone is admixed from different populations, or even 100% of a specific endogamous population. It’s less useful if the tester is 100% European, regardless of whether they are seeking close relatives or not. Conversely, even “vanilla” ethnicity can be used to rule out majority or recent admixture with many populations.

Tools	Unknown Close Relative Seekers	Known Close Relatives – Enhance Genealogy
Ethnicity	Yes, to identify or rule out populations	Yes
Ethnicity Painting	Yes, possibly, depending on population	Yes, possibly, depending on population
Ethnicity Phasing	Yes, possibly, depending on population	Yes, possibly, depending on population
DNA Communities	Yes, possibly, depending on population	Yes, possibly, depending on population
Genetic Groups	Possibly, depending on population	Possibly, depending on population
Family Matching aka Bucketing	Not if parents are entirely unknown, but yes if one parent is known	Yes
Chromosome Browser	Unlikely	Yes
AutoClusters	Yes	Yes, especially at MyHeritage if Jewish
Estimated Relationships High	Not	No
Relationship Range High	Not reliably	No
More, Smaller Segments	Unlikely	Unlikely other than confirmation
Match List Download	Yes	Yes
Surnames	Yes	Yes
Locations	Yes	Yes
Projects	Yes	Yes
Y DNA	Yes, males only, direct paternal line, identifies surname lineage	Yes, males only, direct paternal line, identifies and correctly places surname lineage
Mitochondrial DNA	Yes, both sexes, direct matrilineal line only	Yes, both sexes, direct matrilineal line only
Public Y Tree	Yes for locations	Yes for locations
Public Mito Tree	Yes for locations	Yes for locations
Discover Y DNA	Yes, for heritage information	Yes, for heritage information
Parents Related – ROH	Possibly	Less useful

Acknowledgments

A HUGE thank you to several people who contributed images and information in order to provide accurate and expanded information on the topic of endogamy. Many did not want to be mentioned by name, but you know who you are!!!

If you have information to add, please post in the comments.

_____________________________________________________________

Follow DNAexplain on Facebook, here or follow me on Twitter, here.

Share the Love!

You’re always welcome to forward articles or links to friends and share on social media.

If you haven’t already subscribed (it’s free,) you can receive an email whenever I publish by clicking the “follow” button on the main blog page, here.

You Can Help Keep This Blog Free

Thank you so much.

DNA Purchases and Free Uploads

FamilyTreeDNA – Y, mitochondrial and autosomal DNA testing
MyHeritage DNA – Autosomal DNA test
MyHeritage FREE DNA file upload – Upload your DNA file from other vendors free
AncestryDNA – Autosomal DNA test
23andMe Ancestry – Autosomal DNA only, no Health
23andMe Ancestry Plus Health

Genealogy Products and Services

MyHeritage FREE Tree Builder – Genealogy software for your computer
MyHeritage Subscription with Free Trial
Legacy Family Tree Webinars – Genealogy and DNA classes, subscription-based, some free
Legacy Family Tree Software – Genealogy software for your computer
Newspapers.com – Search newspapers for your ancestors
NewspaperArchive – Search different newspapers for your ancestors

My Book

DNA for Native American Genealogy – by Roberta Estes, for those ordering the e-book from anyplace, or paperback within the United States
DNA for Native American Genealogy – for those ordering the paperback outside the US

Genealogy Books

Genealogical.com – Lots of wonderful genealogy research books

Genealogy Research

Legacy Tree Genealogists – Professional genealogy research

FamilyTreeDNA Relaunch – New Feature Overview

Posted on July 1, 2021 by Roberta Estes

The brand-new FamilyTreeDNA website is live!

I’m very pleased with the investment that FamilyTreeDNA has made in their genealogy platform and tools. This isn’t just a redesign, it’s more of a relaunch.

I spoke with Dr. Lior Rauchberger, CEO of myDNA, the parent company of FamilyTreeDNA briefly yesterday. He’s excited too and said:

“The new features and enhancements we are releasing in July are the first round of updates in our exciting product roadmap. FamilyTreeDNA will continue to invest heavily in the advancement of genetic genealogy.”

In other words, this is just the beginning.

In case you were wondering, all those features everyone asked for – Lior listened.

Lior said earlier in 2021 that he was going to do exactly this and he’s proven true to his word, with this release coming just half a year after he took the helm. Obviously, he hit the ground running.

A few months ago, Lior said that his initial FamilyTreeDNA focus was going to be on infrastructure, stability, and focusing on the customer experience. In other words, creating a foundation to build on.

The new features, improvements, and changes are massive and certainly welcome.

I’ll be covering the new features in a series of articles, but in this introductory article, I’m providing an overview so you can use it as a guide to understand and navigate this new release.

Change is Challenging

I need to say something here.

Change is hard. In fact, change is the most difficult challenge for humans. We want improvements, yet we hate it when the furniture is rearranged in our “room.” However, we can’t have one without the other.

So, take a deep breath, and let’s view this as a great new adventure. These changes and tools will provide us with a new foundation and new clues. Think of this as finding long-lost documents in an archive about your ancestors. If someone told me that there is a potential for discovering the surname of one of my elusive female ancestors in an undiscovered chest in a remote library, trust me, I’d be all over it – regardless of where it was or how much effort I had to expend to get there. In this case, I can sit right here in front of my computer and dig for treasure.

We just need to learn to navigate the new landscape in a virtual room. What a gift!

Let’s start with the first thing you’ll see – the main page when you sign in.

Redesigned Main Page

The FamilyTreeDNA main page has changed. To begin with, the text is darker and the font is larger across the entire platform. OMG, thank you!!!

The main page has been flipped left to right, with results on the left now. Projects, surveys, and other information, along with haplogroup badges are on the right. Have you answered any surveys? I don’t think I even noticed them before. (My bad!)

Click any image to enlarge.

The top tabs have changed too. The words myTree and myProjects are now gone, and descriptive tabs have replaced those. The only “my” thing remaining is myOrigins. This change surprises me with myDNA being the owner.

The Results & Tools tab at the top shows the product dropdowns.

The most popular tabs are shown individually under each product, with additional features being grouped under “See More.”

Every product now has a “See More” link where less frequently used widgets will be found, including the raw data downloads. This is the Y DNA “See More” dropdown by way of example.

You can see the green Updated badge on the Family Finder Matches tab. I don’t know if that badge will always appear when customers have new matches, or if it’s signaling that all customers have updated Family Finder Matches now.

We’ll talk about matches in the Family Finder section.

The Family Finder “See More” tab includes the Matrix, ancientOrigins, and the raw data file download.

The mitochondrial DNA section, titled Maternal Line Ancestry, mtDNA Results and Tools includes several widgets grouped under the “See More” tab.

Additional Tests and Tools

The Additional Tests and Tools area includes a link to your Family Tree (please do upload or create one,) Public Haplotrees, and Advanced Matches.

Public haplotrees are free-to-the-public Y and mitochondrial DNA trees that include locations. They are also easily available to FamilyTreeDNA customers here.

Please note that you access both types of trees from one location after clicking the Public Haplotrees page. The tree defaults to Y-DNA, but just click on mtDNA to view mitochondrial haplogroups and locations. Both trees are great resources because they show the location flags of the earliest known ancestors of the testers within each haplogroup.

Advanced Matches used to be available from the menu within each test type, but since advanced matching includes all three types of tests, it’s now located under the Additional Tests and Tools banner. Don’t forget about Advanced Matches – it’s really quite useful to determine if someone matches you on multiple types of tests and/or within specific projects.

Hey, look – I found a tooltip. Just mouse over the text and tabs on various pages to see where tooltips have been added.

Help and Help Center

The new Help Center is debuting in this release. The former Learning Center is transitioning to the Help Center with new, updated content.

Here’s an example of the new easy-to-navigate format. There’s a search function too.

Each individual page, test type, and section on your personal home page has a “Helpful Information” button.

On the main page, at the top right, you’ll see a new Help button.

Did you see that Submit Feedback link?

If you click on the Help Center, you’ll be greeted with context-sensitive help.

I clicked through from the dashboard, so that’s what I’m seeing. However, other available topics are shown at left.

I clicked on both of the links shown and the content has been updated with the new layout and features. No wonder they launched a new Help Center!

Account Settings

Account settings are still found in the same place, and those pages don’t appear to have changed. However, please keep in mind that some settings make take up to 24 hours to take effect.

Family Finder Rematching

Before we look at what has changed on your Family Finder pages, let’s talk about what happened behind the scenes.

FamilyTreeDNA has been offering the Family Finder test for 11 years, one of two very early companies to enter that marketspace. We’ve learned so much since then, not only about DNA itself, but about genetic genealogy, matching, triangulation, population genetics, how to use these tools, and more.

In order to make improvements, FamilyTreeDNA changing the match criteria which necessitated rematching everyone to everyone else.

If you have a technology background of any type, you’ll immediately realize that this is a massive, expensive undertaking requiring vast computational resources. Not only that, but the rematching has to be done in tandem with new kits coming in, coordinated for all customers, and rolled out at once. Based on new matches and features, the user interface needed to be changed too, at the same time.

Sounds like a huge headache, right?

Why would a company ever decide to undertake that, especially when there is no revenue for doing so? The answer is to make functionality and accuracy better for their customers. Think of this as a new bedrock foundation for the future.

FamilyTreeDNA has made computational changes and implemented several features that require rematching:

Improved matching accuracy, in particular for people in highly endogamous populations. People in this category have thousands of matches that occur simply because they share multiple distant ancestors from within the same population. That combination of multiple common ancestors makes their current match relationships appear to be closer in time than they are. In order to change matching algorithms, FamilyTreeDNA had to rewrite their matching software and then run matching all over to enable everyone to receive new, updated match results.
FamilyTreeDNA has removed segments below 6 cM following sustained feedback from the genealogical community.
X matching has changed as well and no longer includes anyone as an X match below 6 cM.
Family Matching, meaning paternal, maternal and both “bucketing” uses triangulation behind the scenes. That code also had to be updated.
Older transfer kits used to receive only closer matches because imputation was not in place when the original transfer/upload took place. All older kits have been imputed now and matched with the entire database, which is part of why you may have more matches.
Relationship range calculations have changed, based on the removal of microsegments, new matching methodology and rematching results.
FamilyTreeDNA moved to hg37, known as Build 37 of the human genome. In layman’s terms, as scientists learn about our DNA, the human map of DNA changes and shifts slightly. The boundary lines change somewhat. Versions are standardized so all researchers can use the same base map or yardstick. In some cases, early genetic genealogy implementers are penalized because they will eventually have to rematch their entire database when they upgrade to a new build version, while vendors who came to the party later won’t have to bear that internal expense.

As you can see, almost every aspect of matching has changed, so everyone was rematched against the entire database. You’ll see new results. Some matches may be gone, especially distant matches or if you’re a member of an endogamous population.

You’ll likely have new matches due to older transfer kits being imputed to full compatibility. Your matches should be more accurate too, which makes everyone happy.

I understand a white paper is being written that will provide more information about the new matching algorithms.

Ok, now let’s check out the new Family Finder Matches page.

Family Finder Matches

FamilyTreeDNA didn’t just rearrange the furniture – there’s a LOT of new content.

First, a note. You’ll see “Family Finder” in some places, and “Autosomal DNA” in other places. That’s one and the same at FamilyTreeDNA. The Family Finder test is their autosomal test, named separately because they also have Y DNA and mitochondrial DNA tests.

When you click on Family Finder matches for the first time, you will assuredly notice one thing and will probably notice a second.

First, you’ll see a little tour that explains how to use the various new tools.

Secondly, you will probably see the “Generating Matches” notice for a few seconds to a few minutes while your match list is generated, especially if the site is busy because lots of people are signing on. I saw this message for maybe a minute or two before my match list filled.

This should be a slight delay, but with so many people signing in right now, my second kit took longer. If you receive a message that says you have no matches, just refresh your page. If you had matches before, you DO have matches now.

While working with the new interface this morning, I’ve found that refreshing the screen is the key to solving issues.

My kits that have a few thousand matches loaded Family Matching (bucketing) immediately, but this (Jewish) kit that has around 30,000 matches received this informational message instead. FamilyTreeDNA has removed the little spinning icon. If you mouse over the information, you’ll see the following message:

This isn’t a time estimate. Everyone receives the same message. The message didn’t even last long enough for me to get a screenshot on the first kit that received this message. The results completed within a minute or so. The Family Matching buckets will load as soon as the parental matching is ready.

These delays should only happen the first time, or if someone has a lot of matches that they haven’t yet viewed. Once you’ve signed in, your matches are cached, a technique that improves performance, so the loading should be speedy, or at least speedier, during the second and subsequent visits.

Of course, right now, all customers have an updated match list, so there’s something new for everyone.

Getting Help

Want to see that tutorial again?

Click on that little Help box in the upper right-hand corner. You can view the Tutorial, look at Quick References that explain what’s on this page, visit the Help Center or Submit Feedback.

Two Family Finder Matches Views – Detail and Table

The first thing you’ll notice is that there are two views – Detail View and Table View. The default is Detail View.

Take a minute to get used to the new page.

Detail View – Filter Matches by Match Type

I was pleased to see new filter buttons, located in several places on the page.

The Matches filter at left allows you to display only specific relationship levels, including X-Matches which can be important in narrowing matches to a specific subset of ancestors.

You can display only matches that fall within certain relationship ranges. Note the new “Remote Relative” that was previously called speculative.

Parental Matching and Filtering by Test Type or Trees

All of your matches are displayed by default, of course, but you can click on Paternal, Maternal or Both, like before to view only matches in those buckets. In order for the Family Matching bucketing feature to be enabled, you must attach known relatives’ DNA matches to their proper place in your tree.

Please note that I needed to refresh the page a couple of times to get my parental matches to load the first time. I refreshed a couple of times to be sure that all of my bucketed matches loaded. This should be a first-time loading blip.

There’s a new filter button to the right of the bucketing tabs.

You can now filter by who has trees and who has taken which kinds of tests.

You can apply multiple filters at the same time to further narrow your matches.

Important – Clearing Filters

It’s easy to forget you have a filter enabled. This section is important, in part because Clear Filter is difficult to find.

The clear filter button does NOT appear until you’ve selected a filter. However, after applying that filter, to clear it and RESET THE MATCHES to unfiltered, you need to click on the “Clear Filter” button which is located at the top of the filter selections, and then click “Apply” at the bottom of the menu. I looked for “clear filter” forever before finding it here.

You’re welcome😊

Enhanced Search

Thank goodness, the search functionality has been enhanced and simplified too. Full name search works, both here and on the Y DNA search page.

If you type in a surname without selecting any search filters, you’ll receive a list of anyone with that word in their name, or in their list of ancestral surnames. This does NOT include surnames in their tree if they have not added those surnames to their list of ancestral surnames.

Notice that your number of total matches and bucketed people will change based on the results of this search and any filters you have applied.

I entered Estes in the search box, with no filters. You can see that I have a total of 46 matches that contain Estes in one way or another, and how they are bucketed.

Estes is my birth surname. I noticed that three people with Estes in their information are bucketed maternally. This is the perfect example of why you can’t assume a genetic relationship based on only a surname. Those three people’s DNA matches me on my mother’s side. And yes, I confirmed that they matched my mother too on that same segment or segments.

Search Filters

You can also filter by haplogroup. This is very specific. If you select mitochondrial haplogroup J, you will only receive Family Finder matches that have haplogroup J, NOT J1 or J1c or J plus anything.

If you’re looking for your own haplogroup, you’ll need to type your full haplogroup in the search box and select mtDNA Haplogroup in the search filter dropdown.

Resetting Search Results

To dismiss search results, click on the little X. It’s easy to forget that you have initiated a search, so I need to remember to dismiss searches after I’m finished with each one.

Export Matches

The “Export CSV” button either downloads your entire match list, or the list of filtered matches currently selected. This is not your segment information, but a list of matches and related information such as which side they are bucketed on, if any, notes you’ve made, and more.

Your segment information is available for download on the chromosome browser.

Sort By

The Sort By button facilitates sorting your matches versus filtering your matches. Filters ONLY display the items requested, while sorts display all of the items requested, sorting them in a particular manner.

You can sort in any number of ways. The default is Relationship Range followed by Shared DNA.

Your Matches – Detail View

A lot has changed, but after you get used to the new interface, it makes more sense and there are a lot more options available which means increased flexibility. Remember, you can click to enlarge any of these images.

To begin with, you can see the haplogroups of your matches if they have taken a Y or mitochondrial DNA test. If you match someone, you’ll see a little check in the haplogroup box. I’m not clear whether this means you’re a haplogroup match or that person is on your match list.

To select people to compare in the chromosome browser, you simply check the little square box to the left of their photo and the chromosome browser box pops up at the bottom of the page. We’ll review the chromosome browser in a minute.

The new Relationship Range prediction is displayed, based on new calculations with segments below 6 cM removed. The linked relationship is displayed below the range.

A linked relationship occurs when you link that person to their proper place in your tree. If you have no linked relationship, you’ll see a link to “assign relationship” which takes you to your tree to link this person if you know how you are related.

The segments below 6 cM are gone from the Shared DNA total and X matches are only shown if they are 6 cM or above.

In Common With and Not In Common With

In Common With and Not In Common With is the little two-person icon at the right.

Just click on the little person icon, then select “In Common With” to view your shared matches between you, that match, and other people. The person you are viewing matches in common with is highlighted at the top of the page, with your common matches below.

You can stack filters now. In this example, I selected my cousin, Don, to see our common matches. I added the search filter of the surname Ferverda, my mother’s maiden name. She is deceased and I manage her kit. You can see that my cousin Don and I have 5 total common matches – four maternal and one both, meaning one person matches me on both my maternal and paternal lines.

It’s great news that now Cousin Don pops up in the chromosome browser box at the bottom, enabling easy confusion-free chromosome segment comparisons directly from the In Common With match page. I love this!!!.

All I have to do now is click on other people and then on Compare Relationship which pushes these matches through to the chromosome browser. This is SOOOO convenient.

You’ll see a new tree icon at right on each match. A dark tree means there’s content and a light tree means this person does not have a tree. Remember, you can filter by trees with content using the filter button beside “Both”.

Your notes are shown at far right. Any person with a note is dark grey and no note is white.

If you’re looking for the email contact information, click on your match’s name to view their placard which also includes more detailed ancestral surname information.

Family Finder – Table View

The table view is very similar to the Detail View. The layout is a bit different with more matches visible in the same space.

This view has lots of tooltips on the column heading bar! Tooltips are great for everyone, but especially for people just beginning to find their way in the genetic genealogy world.

I’ll have to experiment a bit to figure out which view I prefer. I’d like to be able to set my own default for whichever view I want as my default. In fact, I think I’ll submit that in the “Submit Feedback” link. For every suggestion, I’m going to find something really positive to say. This was an immense overhaul.

Chromosome Browser

Let’s look at the chromosome Browser.

You can arrive at the Chromosome Browser by selecting people on your match page, or by selecting the Chromosome Browser under the Results and Tools link.

Everything is pretty much the same on the chromosome browser, except the default view is now 6 cM and the smaller segments are gone. You can also choose to view only segments above 10 cM.

If you have people selected in the chromosome browser and click on Download Segments in the upper right-hand corner, it downloads the segments of only the people currently selected.

You can “Clear All” and then click on Download All Segments which downloads your entire segment file. To download all segments, you need to have no people selected for comparison.

The contents of this file are greatly reduced as it now contains only the segments 6 cM and above.

Family Tree

No, the family tree has not changed, and yes, it needs to, desperately. Trust me, the management team is aware and I suspect one of the improvements, hopefully sooner than later, will be an improved tree experience.

Y DNA

The Y DNA page has received an update too, adding both a Detail View and a Table View with the same basic functionality as the Family Finder matching above. If you are reading this article for Y DNA only, please read the Family Finder section to understand the new layout and features.

Like previously, the match comparison begins at the 111 marker level.

However, there’s a BIG difference. If there are no matches at this level, YOU NEED TO CLICK THE NEXT TAB. You can easily see that this person has matches at the 67 level and below, but the system no longer “counts down” through the various levels until it either finds a level with a match or reaches 12 markers.

If you’re used to the old interface, it’s easy to think you’re at the final destination of 12 markers with no matches when you’re still at 111.

Y DNA Detail View

The Y-DNA Detail and Table views features are the same as Family Finder and are described in that section.

The new format is quite different. One improvement is that the Paternal Country of Origin is now displayed, along with a flag. How cool is that!

The Paternal Earliest Known Ancestor and Match Date are at far right. Note that match dates have been reset to the rerun date. At this point, FamilyTreeDNA is evaluating the possibility of restoring the original match date. Regardless, you’ll be able to filter for match dates when new matches arrive.

Please check to be sure you have your Country of Origin, Earliest Known Ancestor, and mapped location completed and up to date.

Earliest Known Ancestor

If you haven’t completed your Earliest Known Ancestor (EKA) information, now’s the perfect time. It’s easy, so let’s do it before you forget.

Click on the Account Settings gear beneath your name in the right-hand upper corner. Click on Genealogy, then on Earliest Known Ancestors and complete the information in the red boxes.

Direct paternal line means your father’s father’s father’s line – as far up through all fathers as you can reach. This is your Y DNA lineage, but females should complete this information on general principles.
Direct maternal line means your mother’s mother’s mother’s line – as far up through all mothers that you can reach. This is your mitochondrial DNA lineage, so relevant for both males and females.

Completing all of the information, including the location, will help you and your matches as well when using the Matches Map.

Be sure to click Save when you’re finished.

Y DNA Filters

Y DNA has more filter options than autosomal.

The Y DNA filter, located to the right of the 12 Markers tab allows testers to filter by:

Genetic distance, meaning how many mutations difference between you and your matches
Groups meaning group projects that the tester has joined
Tree status
Match date
Level of test taken

If none of your matches have taken the 111 marker test or you don’t match anyone at that level, that test won’t show up on your list.

Y DNA Table View

As with Family Finder, the Table View is more condensed and additional features are available on the right side of each match. For details, please review the Family Finder section.

If you’re looking for the old Y DNA TiP report, it’s now at the far right of each match.

The actual calculator hasn’t changed yet. I know people were hoping for the new Y DNA aging in this release, but that’s yet to follow.

Other Pages

Other pages like the Big Y and Mitochondrial DNA did not receive new features or functionality in this release, but do sport new user-friendly tooltips.

I lost track, but I counted over 100 tooltips added across the platform, and this is just the beginning.

There are probably more new features and functionality that I haven’t stumbled across just yet.

And yes, we are going to find a few bugs. That’s inevitable with something this large. Please report anything you find to FamilyTreeDNA.

Oh wait – I almost forgot…

New Videos

I understand that there are in the ballpark of 50 new videos that are being added to the new Help Center, either today or very shortly.

When I find out more, I’ll write an article about what videos are available and where to find them. People learn in various ways. Videos are often requested and will be a popular addition. I considered making videos, but that’s almost impossible for anyone besides the vendor because the names on screens either need to be “fake” or the screen needs to be blurred.

So hurray – very glad to hear these are imminent!

Stay Tuned

Stay tuned for new developments. As Lior said, FamilyTreeDNA is investing heavily in genetic genealogy and there’s more to come.

My Mom used to say that the “proof is in the pudding.” I’d say the myDNA/FamilyTreeDNA leadership team has passed this initial test with flying colors.

Of course, there’s more to do, but I’m definitely grateful for this lovely pudding. Thank you – thank you!

I can’t wait to get started and see what new gems await.

Take a Look!

Do you have more matches?

Are your matches more accurate?

How about predicted relationships?

How has this new release affected you?

What do you like the best?

_____________________________________________________________

Disclosure

Thank you so much.

DNA Purchases and Free Transfers

FamilyTreeDNA – Y, mitochondrial and autosomal DNA testing
MyHeritage DNA – Autosomal DNA test
MyHeritage FREE DNA file upload – Transfer your results from other vendors free
AncestryDNA – Autosomal DNA test
23andMe Ancestry – Autosomal DNA only, no Health
23andMe Ancestry Plus Health

Genealogy Products and Services

MyHeritage FREE Tree Builder – Genealogy software for your computer
MyHeritage Subscription with Free Trial
Legacy Family Tree Webinars – Genealogy and DNA classes, subscription-based, some free
Legacy Family Tree Software – Genealogy software for your computer
Charting Companion – Charts and Reports to use with your genealogy software or FamilySearch
RootsMagic Software – Genealogy software for your computer
Newspapers.com – Search newspapers for your ancestors

Books

Genealogical.com – Lots of wonderful genealogy research books

Genealogy Research

Legacy Tree Genealogists – Professional genealogy research

Triangulation in Action at Family Tree DNA

Posted on November 6, 2019 by Roberta Estes

Recently, I published the article, Hitting a Genealogy Home Run Using Your Double-Sided Two-Faced Chromosomes While Avoiding Imposters. The “Home Run” article explains why you want to use a chromosome browser, what you’re seeing and what it means to you.

This article, and the rest in the “Triangulation in Action” series introduces triangulation at Family Tree DNA, MyHeritage, 23andMe, GedMatch and DNAPainter, explaining how to use triangulation to confirm descent from a common ancestor. You may want to read the introductory article first.

What is Triangulation?

Think of triangulation as a three-legged stool – a triangle. Triangulation requires three things:

At least three (not closely related) people must match
On the same reasonably sized segment of DNA and
Descend from a common ancestor

Triangulation is the foundation of confirming descent from a common ancestor, and thereby assigning a specific segment to that ancestor. Without triangulation, you might just have a match to someone else by chance. You can confirm mathematical triangulation, numbers 1 and 2, above, without knowing the identity of the common ancestor.

Boundaries

Triangulation means that all three, or more, people much match on a common segment. However, what you’re likely to see is that some people don’t match on the entire segment, meaning more or less than others as demonstrated in the following examples.

FTDNA Triangulation boundaries.png

You can see that I match 5 different cousins who I know descend from my father’s side on chromosome 15 above. As always, I’m the background grey and these matches are all being compared against me.

I triangulate with them in different ways, forming multiple triangulation groups that I’ve discussed individually, below.

Triangulation Group 1

FTDNA triangulation 1.png

Group 1 – On the left group of matches, above, I triangulate with the blue, red and orange person on the amount of DNA that is common between all of them, shown in the black box. This is triangulation group 1.

I’ve overlayed additional triangulation groups below, so you can compare the groups.

Triangulation Group 2

FTDNA triangulation 2.png

Group 2 – However, if you look just at the blue and orange triangulated matches bracketed in green, I triangulate on slightly more, extending to the left. This group excludes the red person because their beginning point is not the same, or even close. This is triangulation group 2.

Triangulation Group 3 and 4

FTDNA triang 3.png

Group 3 – At right, we see two large triangulation groups. Triangulation group 3 includes the common portions of blue, red, teal and orange matches.

Group 4 – Triangulation group 4 is the skinny group at far right and includes the common portion of the blue, teal and dark blue matches.

Triangulation Groups 5 and 6

FTDNA triang 5.png

Group 5 – There are also two more triangulation groups. The larger green bracketed group includes only the blue and teal people because their end locations are to the right of the end locations of the red and orange matches. The start location varies as well. This is triangulation group 5.

Group 6 – The smaller green bracketed group includes only the blue and teal person because their start locations are before the dark blue person. This is triangulation group 6.

There’s actually one more triangulation group. Can you spot it?

Triangulation Group 7

FTDNA triang 7.png

Group 7 – The tan group includes the red, teal and orange matches but only the areas where they all overlap. This excludes the top blue match because their start location is different. Triangulation group 7 only extends to the end of the red and orange matches, because those are the same locations, while the teal match extends further to the right. That extension is excluded in this group, of course.

Slight Variations

Matches with only slight start and end differences are probably descended from the same ancestor, but we can’t say that for sure (at this point) so we only include actual mathematically matching segments in a triangulation group.

You can see that triangulation groups often overlap because group members share more or less DNA with each other. Normally we don’t bother to number the groups – we just look at the alignment. I numbered them for illustration purposes.

Shared or In-Common-With Matching

Triangulation is not the same thing as a 3-way shared “in-common-with” match. You may share DNA with those two people, but on entirely different segments from entirely different ancestors. If those other two people match each other, it can be on a segment where you don’t match either of them, and thanks to an ancestor that they share who isn’t in your line at all. Shared matches are a great hint, especially in addition to other information such as Phased Family Matching which we’ll talk about in a minute, but shared matches don’t necessarily mean triangulation has occurred, although it’s a great place to start looking.

I have shared matches where I match one person on my maternal side, one on my paternal side, and they match each other through a completely different ancestor on an entirely different segment. However, we don’t triangulate because we don’t all match each other on the SAME segment of DNA. Yes, it can be confusing.

Just remember, each of your segments, and matches, has its own individual history.

Imputation Can Affect Matching

Over the years the chips on which our DNA is processed at the vendors have changed. Each new generation of chips tests a different number of markers, and sometimes different markers – with the overlaps between the entire suite of chips being less than optimal.

I can verify that most vendors use imputation to level the playing field, and even though two vendors have never verified that fact, I’m relatively certain that they all do. That’s the only way they could match to their own prior “only somewhat compatible” chip versions.

The net-net of this is that you may see some differences in matching segments at different vendors, even when you’re comparing the same people. Imputation generally “fills in the blanks,” but doesn’t create large swatches of non-existent DNA. I wrote about the concept of imputation here.

What I’d like for you to take away from this discussion is to be focused on the big picture – if and how people triangulate which is the function important to genealogy. Not if the start and end segments are exactly the same.

Triangulation Solutions

Each of the major vendors, except Ancestry who does not have a chromosome browser, offers some type of triangulation solution, so let’s look at what each vendor offers. If your Ancestry matches have uploaded to GedMatch, Family Tree DNA or MyHeritage, you can triangulate with them there. Otherwise, you can’t triangulate Ancestry results, so encourage your Ancestry matches to transfer.

You can find step-by-step transfer instructions to and from each vendor, here.

I wrote more specifically about triangulation here and here.

Let’s start by looking at triangulation at Family Tree DNA.

Triangulation at Family Tree DNA

Family Tree DNA has two different tools that can be used separately in different circumstances to determine whether or not your segments triangulate.

Phased Family Matching can be used for triangulation.

The Matrix tool can be utilized for people who aren’t designated through Phased Family Matching as maternal or paternal matches to suggest or eliminate triangulation.

First, go to the Family Finder section of your personal page.

We’ll be working with Matches, the Chromosome Browser, and the Matrix.

FTDNA triangulation page.png

Phased Family Matching

At Family Tree DNA, I’ve tested my cousins:

Cheryl, my mother’s first cousin (1C)
Charlene, my first cousin once removed (1C1R) on my father’s side
David, my second cousin (2C) on my father’s side.

I’ve linked the test results of those cousins to my tree in their proper location, which allows Family Tree DNA to do something called Phased Family Matching.

If you don’t have a tree and don’t link your DNA results and those of your family members, Family Tree DNA can’t perform Phased Family Matching.

I explained phasing in the introductory article.

Testing your parents is wonderful if that’s possible, but parents aren’t always available to test. At Family Tree DNA, you don’t need to have tested your parents in order to have phased matches.

In essence, Family Tree DNA uses the DNA of known cousins, third cousins or closer, to assign matches to maternal or paternal tabs, or sides, also sometimes referred to as buckets. I wrote about Phased Family Matching here and here.

You can see that of my 4806 matches, 1101 are assigned to my paternal side, 884 to my maternal side and 4 are assigned to both.

My cousin Charlene is assigned to my paternal side, as shown by the blue icon, because I linked her to the correct position in my tree, as is my cousin, David, below.

Conversely, my cousin Cheryl is assigned maternally because I linked her as well.

These specific people are assigned maternally and paternally because I linked them to their proper place in my tree. These matches will allows Family Tree DNA to link other testers to the proper side of my tree too, because they match me and my cousin on the same segments – in essence phasing a large number of my matches for me which facilitates triangulation.

Linking Matches on Your Tree

In order to cause Phased Family Matching, aka, “bucketing” to occur, I linked my own test and that of my known 3^rd cousins or closer to their proper places in my tree at Family Tree DNA.

If you don’t create a tree or upload a GEDCOM file and link yourself and your known matches, your matches can’t be assigned to maternal and paternal sides.

FTDNA triang tree.png

By utilizing the matching DNA between you and known close relatives on your maternal and paternal sides, Family Tree DNA assigns other people who match both of you on those same segments to the same side of your tree.

If you select matches from the same side of your tree and they match on the same segments, they triangulate.

Of course, that’s assuming the person doesn’t match you on both sides of your tree.

You can also download your matching segments in a file and sort to see who matches on the same locations, but the parental side designation (bucketing) is not reflected in the segment download file. Bucketing is reflected in the match download file which is a different file.

There are two separate download files, but they can be merged.

Two Download Files

The first file, your match download file, provides information about your matches such as their haplogroups, surnames and contact information, including bucketing assignment, but not the actual matching segment data.

The match file tells you a great deal and is both sortable and searchable. You can search for any surname, for example, or you can sort for everyone in the Paternal or Maternal matching bucket. You can creatively combine parts of this file with the matching segments file in order to quickly flag the people on your paternal side. Knowledge about how to work with spreadsheets is a plus.

Click to enlarge

This download is available at the bottom of the Family Finder match page.

FTDNA triang match.png

You can download all of your matches, or just those in a filtered view, such as in-common-with or as the result of a surname search.

FTDNA triang download.png

The second file, your matching segments file, is available on the chromosome browser page.

The matching segments file includes the match name along with the matching chromosome segments and number of matching SNPs.

FTDNA triang segment file.png

If you click through to the chromosome browser from your main page, as shown below, with NO MATCHES SELECTED, you will be able to download ALL matching segments.

FTDNA triang browser.png

You’ll see “Download All Segments” in the upper right-hand corner.

From that Chromosome Browser page, you will also have the ability to select matches to show on the browser.

FTDNA triang browser select

If you select people on the match page before clicking on the chromosome browser or select matches on the chromosome browser page, then clicking on “Download Segments,” will only download the matching segments of the people that you have currently selected to match against in the browser.

Combinations of Tools and Filters

The chromosome browser tells you if people match you on the same segment.
The in-common-with filter on the match page tells you who you match in common with a specific person, but not if those two people match each other.

Of course, if both people are assigned to your same parental side bucket, and they both only match you on one large segment – and it’s the same segment, then you must triangulate.

If they aren’t both assigned to a parental bucket, then you can’t make that determination using parental side designations.

Is there a tool that allows you to compare people against each other at the same time to see if your matches also match each other?

Glad you asked.

Yes, there is.

The Matrix

Let’s say that you want to see if a group of people who you match also match each other.

FTDNA triang matrix.png

Family Tree DNA provides a Matrix tool that allows you to select 10 (or fewer) matches in order to determine if your matches also match each other.

FTDNA triang matrix match.png

I’ve entered Cheryl, Charlene and David. You can see that David and Charlene match each other, and Cheryl doesn’t match either Charlene or David.

Of course, we know that’s accurate because:

I already know these people and their relationship to me and each other
These three people are already assigned to maternal and paternal sides or buckets, so the matrix is verifying what we already know
I know where they match on the same segment on the chromosome browser

FTDNA triang 3 browser.png

Even though they match on the same segment on the chromosome browser, the fact that they are bucketed to different parental sides, and that the matrix shows that Cheryl doesn’t match either Charlene and David, confirms that David and Charlene triangulate with me, while Cheryl is not a member of that triangulation group.

This is exactly why triangulation is important. Looking at the image above, the only thing you know is that they all 3 match you – but with the additional information about bucketing and the matrix, we know that only the two bottom people, Charlene and David triangulate with me. Note that I’ve added the maternal and paternal icons for clarity.

FTDNA triang match group browser.png

However, if I didn’t have this knowledge, or not everyone was bucketed, the Matrix tool would be extremely useful. The matrix tool uses the matching threshold of approximately 7.69 cM.

The matrix doesn’t tell you if these people match each other on the same segment where they match you,

However, there’s a good probability that they do, especially if only one matching segment is involved.

You can check the chromosome browser to see if they both match you on the same segment. It’s possible if they don’t match you on the same segment that they match each other on different segments, and possibly through a different ancestor. You may need to reach out to them to ask if they match each other, and if they have known genealogy if they aren’t bucketed.

By utilizing the Matrix tool, you can isolate people to maternal and paternal sides of your tree.

Other Resources to Identify Common Ancestors

Be sure to check other clues at Family Tree DNA such as:

Shared surnames, shown on your matches page, with common surnames that you share bolded

FTDNA triang surnames.png

Trees, indicated by the blue pedigree icon on the match page.

FTDNA triang pedigree.png

Y and mitochondrial DNA haplogroups and matching. You can view your matches haplogroup and other information by clicking on their profile picture on your matches page.

Advanced Matching can be utilized to see if you match on combined tests, or in common projects.

FTDNA triang advanced match.png

This article discusses the 9 different autosomal tools available at Family Tree DNA.

What About You?

Do you have a tree at Family Tree DNA?

Have you connected your test and any family members to your tree?

Can you test a family member, third cousins or closer, or have them transfer a kit from another vendor?

Here’s how to transfer:

How many people do you have on your paternal and maternal tabs on your Family Finder matches page?

You can paint every single one of the people who are designated as maternal or paternal at DNAPainter to your grandparents on the respective maternal or paternal side. DNAPainter Instructions and Resources will explain how, and why.

Join me soon for similar articles about how to work with triangulation at MyHeritage, 23andMe, GedMatch and DNAPainter.

Most of all – have fun!

______________________________________________________________

Disclosure

Thank you so much.

DNA Purchases and Free Transfers

Genealogy Services

Genealogy Research

Legacy Tree Genealogists for genealogy research

2018 – The Year of the Segment

Posted on January 1, 2019 by Roberta Estes

Looking in the rear view mirror, what a year! Some days it’s been hard to catch your breath things have been moving so fast.

What were the major happenings, how did they affect genetic genealogy and what’s coming in 2019?

The SNiPPY Award

First of all, I’m giving an award this year. The SNiPPY.

Yea, I know it’s kinda hokey, but it’s my way of saying a huge thank you to someone in this field who has made a remarkable contribution and that deserves special recognition.

Who will it be this year?

Drum roll…….

The 2018 SNiPPY goes to…

DNAPainter – The 2018 SNiPPY award goes to DNAPainter, without question. Applause, everyone, applause! And congratulations to Jonny Perl, pictured below at Rootstech!

Jonny Perl created this wonderful, visual tool that allows you to paint your matches with people on your chromosomes, assigning the match to specific ancestors.

I’ve written about how to use the tool with different vendors results and have discovered many different ways to utilize the painted segments. The DNA Painter User Group is here on Facebook. I use DNAPainter EVERY SINGLE DAY to solve a wide variety of challenges.

What else has happened this year? A lot!

Ancient DNA – Academic research seldom reports on Y and mitochondrial DNA today and is firmly focused on sequencing ancient DNA. Ancient genome sequencing has only recently been developed to a state where at least some remains can be successfully sequenced, but it’s going great guns now. Take a look at Jennifer Raff’s article in Forbes that discusses ancient DNA findings in the Americas, Europe, Southeast Asia and perhaps most surprising, a first generation descendant of a Neanderthal and a Denisovan.

From Early human dispersals within the Americas by Moreno-Mayer et al, Science 07 Dec 2018

Inroads were made into deeper understanding of human migration in the Americas as well in the paper Early human dispersals within the Americas by Moreno-Mayer et al.

I look for 2019 and on into the future to hold many more revelations thanks to ancient DNA sequencing as well as using those sequences to assist in understanding the migration patterns of ancient people that eventually became us.

Barbara Rae-Venter and the Golden State Killer Case

Using techniques that adoptees use to identify their close relatives and eventually, their parents, Barbara Rae-Venter assisted law enforcement with identifying the man, Joseph DeAngelo, accused (not yet convicted) of being the Golden State Killer (GSK).

A very large congratulations to Barbara, a retired patent attorney who is also a genealogist. Nature recognized Ms. Rae-Venter as one of 2018’s 10 People Who Mattered in Science.

DNA in the News

DNA is also represented on the 2018 Nature list by Viviane Slon, a palaeogeneticist who discovered an ancient half Neanderthal, half Denisovan individual and sequenced their DNA and He JianKui, a Chinese scientist who claims to have created a gene-edited baby which has sparked widespread controversy. As of the end of the year, He Jiankui’s research activities have been suspended and he is reportedly sequestered in his apartment, under guard, although the details are far from clear.

In 2013, 23andMe patented the technology for designer babies and I removed my kit from their research program. I was concerned at the time that this technology knife could cut two ways, both for good, eliminating fatal disease-causing mutations and also for ethically questionable practices, such as eugenics. I was told at the time that my fears were unfounded, because that “couldn’t be done.” Well, 5 years later, here we are. I expect the debate about the ethics and eventual regulation of gene-editing will rage globally for years to come.

Elizabeth Warren’s DNA was also in the news when she took a DNA test in response to political challenges. I wrote about what those results meant scientifically, here. This topic became highly volatile and politicized, with everyone seeming to have a very strongly held opinion. Regardless of where you fall on that opinion spectrum (and no, please do not post political comments as they will not be approved), the topic is likely to surface again in 2019 due to the fact that Elizabeth Warren has just today announced her intention to run for President. The good news is that DNA testing will likely be discussed, sparking curiosity in some people, perhaps encouraging them to test. The bad news is that some of the discussion may be unpleasant at best, and incorrect click-bait at worst. We’ve already had a rather unpleasant sampling of this.

Law Enforcement and Genetic Genealogy

The Golden State Killer case sparked widespread controversy about using GedMatch and potentially other genetic genealogy data bases to assist in catching people who have committed violent crimes, such as rape and murder.

GedMatch, the database used for the GSK case has made it very clear in their terms and conditions that DNA matches may be used for both adoptees seeking their families and for other uses, such as law enforcement seeking matches to DNA sequenced during a criminal investigation. Since April 2018, more than 15 cold case investigations have been solved using the same technique and results at GedMatch. Initially some people removed their DNA from GedMatch, but it appears that the overwhelming sentiment, based on uploads, is that people either aren’t concerned or welcome the opportunity for their DNA matches to assist apprehending criminals.

Parabon Nanolabs in May established a genetic genealogy division headed by CeCe Moore who has worked in the adoptee community for the past several years. The division specializes in DNA testing forensic samples and then assisting law enforcement with the associated genetic genealogy.

Currently, GedMatch is the only vendor supporting the use of forensic sample matching. Neither 23anMe nor Ancestry allow uploaded data, and MyHeritage and Family Tree DNA’s terms of service currently preclude this type of use.

MyHeritage

Wow talk about coming onto the DNA world stage with a boom.

MyHeritage went from a somewhat wobbly DNA start about 2 years ago to rolling out a chromosome browser at the end of January and adding important features such as SmartMatching which matches your DNA and your family trees. Add triangulation to this mixture, along with record matching, and you’re got a #1 winning combination.

It was Gilad Japhet, the MyHeritage CEO who at Rootstech who christened 2018 “The Year of the Segment,” and I do believe he was right. Additionally, he announced that MyHeritage partnered with the adoption community by offering 15,000 free kits to adoptees.

In November, MyHeritage hosted MyHeritage LIVE, their first user conference in Oslo, Norway which focused on both their genealogical records offerings as well as DNA. This was a resounding success and I hope MyHeritage will continue to sponsor conferences and invest in DNA. You can test your DNA at MyHeritage or upload your results from other vendors (instructions here). You can follow my journey and the conference in Olso here, here, here, here and here.

GDPR

GDPR caused a lot of misery, and I’m glad the implementation is behind us, but the the ripples will be affecting everyone for years to come.

GDPR, the European Data Protection Regulation which went into effect on May 25, 2018 has been a mixed and confusing bag for genetic genealogy. I think the concept of users being in charge and understanding what is happened with their data, and in this case, their data plus their DNA, is absolutely sound. The requirements however, were created without any consideration to this industry – which is small by comparison to the Googles and Facebooks of the world. However, the Googles and Facebooks of the world along with many larger vendors seem to have skated, at least somewhat.

Other companies shut their doors or restricted their offerings in other ways, such as World Families Network and Oxford Ancestors. Vendors such as Ancestry and Family Tree DNA had to make unpopular changes in how their users interface with their software – in essence making genetic genealogy more difficult without any corresponding positive return. The potential fines, 20 million plus Euro for any company holding data for EU residents made it unwise to ignore the mandates.

In the genetic genealogy space, the shuttering of both YSearch and MitoSearch was heartbreaking, because that was the only location where you could actually compare Y STR and mitochondrial HVR1/2 results. Not everyone uploaded their results, and the sites had not been updated in a number of years, but the closure due to GDPR was still a community loss.

Today, mitoydna.org, a nonprofit comprised of genetic genealogists, is making strides in replacing that lost functionality, plus, hopefully more.

On to more positive events.

Family Tree DNA

In April, Family Tree DNA announced a new version of the Big Y test, the Big Y-500 in which at least 389 additional STR markers are included with the Big Y test, for free. If you’re lucky, you’ll receive between 389 and 439 new markers, depending on how many STR markers above 111 have quality reads. All customers are guaranteed a minimum of 500 STR markers in total. Matching was implemented in December.

These additional STR markers allow genealogists to assemble additional line marker mutations to more granularly identify specific male lineages. In other words, maybe I can finally figure out a line marker mutation that will differentiate my ancestor’s line from other sons of my founding ancestor😊

In June, Family Tree DNA announced that they had named more than 100,000 SNPs which means many haplogroup additions to the Y tree. Then, in September, Family Tree DNA published their Y haplotree, with locations, publicly for all to reference.

I was very pleased to see this development, because Family Tree DNA clearly has the largest Y database in the industry, by far, and now everyone can reap the benefits.

In October, Family Tree DNA published their mitochondrial tree publicly as well, with corresponding haplogroup locations. It’s nice that Family Tree DNA continues to be the science company.

You can test your Y DNA, mitochondrial or autosomal (Family Finder) at Family Tree DNA. They are the only vendor offering full Y and mitochondrial services complete with matching.

2018 Conferences

Of course, there are always the national conferences we’re familiar with, but more and more, online conferences are becoming available, as well as some sessions from the more traditional conferences.

I attended Rootstech in Salt Lake City in February (brrrr), which was lots of fun because I got to meet and visit with so many people including Mags Gaulden, above, who is a WikiTree volunteer and writes at Grandma’s Genes, but as a relatively expensive conference to attend, Rootstech was pretty miserable. Rootstech has reportedly made changes and I hope it’s much better for attendees in 2019. My attendance is very doubtful, although I vacillate back and forth.

On the other hand, the MyHeritage LIVE conference was amazing with both livestreamed and recorded sessions which are now available free here along with many others at Legacy Family Tree Webinars.

Family Tree University held a Virtual DNA Conference in June and those sessions, along with others, are available for subscribers to view.

The Virtual Genealogical Association was formed for those who find it difficult or impossible to participate in local associations. They too are focused on education via webinars.

Genetic Genealogy Ireland continues to provide their yearly conference sessions both livestreamed and recorded for free. These aren’t just for people with Irish genealogy. Everyone can benefit and I enjoy them immensely.

Bottom line, you can sit at home and educate yourself now. Technology is wonderful!

2019 Conferences

In 2019, I’ll be speaking at the National Genealogical Society Family History Conference, Journey of Discovery, in St. Charles, providing the Special Thursday Session titled “DNA: King Arthur’s Mighty Genetic Lightsaber” about how to use DNA to break through brick walls. I’ll also see attendees at Saturday lunch when I’ll be providing a fun session titled “Twists and Turns in the Genetic Road.” This is going to be a great conference with a wonderful lineup of speakers. Hope to see you there.

There may be more speaking engagements at conferences on my 2019 schedule, so stay tuned!

The Leeds Method

In September, Dana Leeds publicized The Leeds Method, another way of grouping your matches that clusters matches in a way that indicates your four grandparents.

I combine the Leeds method with DNAPainter. Great job Dana!

Genetic Affairs

In December, Genetic Affairs introduced an inexpensive subscription reporting and visual clustering methodology, but you can try it for free.

I love this grouping tool. I have already found connections I didn’t know existed previously. I suggest joining the Genetic Affairs User Group on Facebook.

DNAGedcom.com

I wrote an article in January about how to use the DNAGedcom.com client to download the trees of all of your matches and sort to find specific surnames or locations of their ancestors.

However, in December, DNAGedcom.com added another feature with their new DNAGedcom client just released that downloads your match information from all vendors, compiles it and then forms clusters. They have worked with Dana Leeds on this, so it’s a combination of the various methodologies discussed above. I have not worked with the new tool yet, as it has just been released, but Kitty Cooper has and writes about it here. If you are interested in this approach, I would suggest joining the Facebook DNAGedcom User Group.

Rootsfinder

I have not had a chance to work with Rootsfinder beyond the very basics, but Rootsfinder provides genetic network displays for people that you match, as well as triangulated views. Genetic networks visualizations are great ways to discern patterns. The tool creates match or triangulation groups automatically for you.

Training videos are available at the website and you can join the Rootsfinder DNA Tools group at Facebook.

Chips and Imputation

Illumina, the chip maker that provides the DNA chips that most vendors use to test changed from the OmniExpress to the GSA chip during the past year. Older chips have been available, but won’t be forever.

The newer GSA chip is only partially compatible with the OmniExpress chip, providing limited overlap between the older and the new results. This has forced the vendors to use imputation to equalize the playing field between the chips, so to speak.

This has also caused a significant hardship for GedMatch who is now in the position of trying to match reasonably between many different chips that sometimes overlap minimally. GedMatch introduced Genesis as a sandbox beta version previously, but are now in the process of combining regular GedMatch and Genesis into one. Yes, there are problems and matching challenges. Patience is the key word as the various vendors and GedMatch adapt and improve their required migration to imputation.

DNA Central

In June Blaine Bettinger announced DNACentral, an online monthly or yearly subscription site as well as a monthly newsletter that covers news in the genetic genealogy industry.

Many educators in the industry have created seminars for DNACentral. I just finished recording “Getting the Most out of Y DNA” for Blaine.

Even though I work in this industry, I still subscribed – initially to show support for Blaine, thinking I might not get much out of the newsletter. I’m pleased to say that I was wrong. I enjoy the newsletter and will be watching sessions in the Course Library and the Monthly Webinars soon.

If you or someone you know is looking for “how to” videos for each vendor, DNACentral offers “Now What” courses for Ancestry, MyHeritage, 23andMe, Family Tree DNA and Living DNA in addition to topic specific sessions like the X chromosome, for example.

Social Media

2018 has seen a huge jump in social media usage which is both bad and good. The good news is that many new people are engaged. The bad news is that people often given faulty advice and for new people, it’s very difficult (nigh on impossible) to tell who is credible and who isn’t. I created a Help page for just this reason.

You can help with this issue by recommending subscribing to these three blogs, not just reading an article, to newbies or people seeking answers.

https://dna-explained.com/ (this blog)
https://thegeneticgenealogist.com/ (Blaine Bettinger’s)
https://www.legalgenealogist.com/ (Judy Russell’s)

Always feel free to post links to my articles on any social media platform. Share, retweet, whatever it takes to get the words out!

The general genetic genealogy social media group I would recommend if I were to select only one would be Genetic Genealogy Tips and Techniques. It’s quite large but well-managed and remains positive.

I’m a member of many additional groups, several of which are vendor or interest specific.

Genetic Snakeoil

Now the bad news. Everyone had noticed the popularity of DNA testing – including shady characters.

Be careful, very VERY careful who you purchase products from and where you upload your DNA data.

If something is free, and you’re not within a well-known community, then YOU ARE THE PRODUCT. If it sounds too good to be true, it probably is. If it sounds shady or questionable, it’s probably that and more, or less.

If reputable people and vendors tell you that no, they really can’t determine your Native American tribe, for example, no other vendor can either. Just yesterday, a cousin sent me a link to a “tribe” in Canada that will, “for $50, we find one of your aboriginal ancestors and the nation stamps it.” On their list of aboriginal people we find one of my ancestors who, based on mitochondrial DNA tests, is clearly NOT aboriginal. Snake oil comes in lots of flavors with snake oil salesmen looking to prey on other people’s desires.

When considering DNA testing or transfers, make sure you fully understand the terms and conditions, where your DNA is going, who is doing what with it, and your recourse. Yes, read every single word of those terms and conditions. For more about legalities, check out Judy Russell’s blog.

Recommended Vendors

All those DNA tests look yummy-good, but in terms of vendors, I heartily recommend staying within the known credible vendors, as follows (in alphabetical order).

For genetic genealogy for ethnicity AND matching:

23andMe
Ancestry
Family Tree DNA
GedMatch (not a vendor because they don’t test DNA, but a reputable third party)
MyHeritage

You can read about Which DNA Test is Best here although I need to update this article to reflect the 2018 additions by MyHeritage.

Understand that both 23andMe and Ancestry will sell your DNA if you consent and if you consent, you will not know who is using your DNA, where, or for what purposes. Neither Family Tree DNA, GedMatch, MyHeritage, Genographic Project, Insitome, Promethease nor LivingDNA sell your DNA.

The next group of vendors offers ethnicity without matching:

Genographic Project by National Geographic Society
Insitome
LivingDNA (currently working on matching, but not released yet)

Health (as a consumer, meaning you receive the results)

23andMe (limited health)
Promethease

Medical (as a contributor, meaning you are contributing your DNA for research)

23andMe
Ancestry
DNA.Land (not a testing vendor, doesn’t test DNA)

There are a few other niche vendors known for specific things within the genetic genealogy community, many of whom are mentioned in this article, but other than known vendors, buyer beware. If you don’t see them listed or discussed on my blog, there’s probably a reason.

What’s Coming in 2019

Just like we couldn’t have foreseen much of what happened in 2018, we don’t have access to a 2019 crystal ball, but it looks like 2019 is taking off like a rocket. We do know about a few things to look for:

MyHeritage is waiting to see if envelope and stamp DNA extractions are successful so that they can be added to their database.
www.totheletterDNA.com is extracting (attempting to) and processing DNA from stamps and envelopes for several people in the community. Hopefully they will be successful.
LivingDNA has been working on matching since before I met with their representative in October of 2017 in Dublin. They are now in Beta testing for a few individuals, but they have also just changed their DNA processing chip – so how that will affect things and how soon they will have matching ready to roll out the door is unknown.
Ancestry did a 2018 ethnicity update, integrating ethnicity more tightly with Genetic Communities, offered genetic traits and made some minor improvements this year, along with adding one questionable feature – showing your matches the location where you live as recorded in your profile. (23andMe subsequently added the same feature.) Ancestry recently said that they are promising exciting new tools for 2019, but somehow I doubt that the chromosome browser that’s been on my Christmas list for years will be forthcoming. Fingers crossed for something new and really useful. In the mean time, we can download our DNA results and upload to MyHeritage, Family Tree DNA and GedMatch for segment matching, as well as utilize Ancestry’s internal matching tools. DNA+tree matching, those green leaf shared ancestor hints, is still their strongest feature.
The Family Tree DNA Conference for Project Administrators will be held March 22-24 in Houston this year, and I’m hopeful that they will have new tools and announcements at that event. I’m looking forward to seeing many old friends in Houston in March.

Here’s what I know for sure about 2019 – it’s going to be an amazing year. We as a community and also as individual genealogists will be making incredible discoveries and moving the ball forward. I can hardly wait to see what quandaries I’ve solved a year from now.

What mysteries do you want to unravel?

I’d like to offer a big thank you to everyone who made 2018 wonderful and a big toast to finding lots of new ancestors and breaking down those brick walls in 2019.

Happy New Year!!!

______________________________________________________________

Disclosure

I receive a small contribution when you click on some (but not all) of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.

Thank you so much.

DNA Purchases and Free Transfers