May | 2017 | segment-ology

A Segment-ology TIDBIT

There are two concepts at work here – each is different – keep them straight and separate. Each in it’s own way can provide insights.

1. Amount of DNA from Ancestors. Although DNA from your Ancestors is random, there is some uniformity. You get exactly 1/2 of your atDNA from each parent; pretty close to 1/4 from each grandparent; about 1/8 from each great grandparent. By the time you get to your 128 5G grandparents, the average will be 1/128 from each one of them (roughly 1 percent), but with each generation past your parents, the deviation from the average increases. Still – the total of all ancestors at any generation will sum to 100 percent. On average you got about 15cM from each 5G grandparent, which may be in one segment or spread over several segments. The companies report that, in general, you will not share enough DNA to match most of your 6th cousins; but we know that you have so many 6th cousins that you will share with many of them – experience shows you will share from 0-21cM with some your 6th cousins. [see The Shared cM Project 2.0 at https://isogg.org/wiki/Autosomal_DNA_statistics ]

2.Number of Matches from Ancestors. This depends on several factors: Number of cousins, who tests, endogamy, etc.

A. The number of cousins you have from each Ancestor is directly related to the size of their families leading to the number of their living descendants. Larger families => more cousins => more Matches.

B. Similarly the number of cousins you have from each Ancestor is directly related to how far back that Ancestor is. Distant Ancestors => more cousins => more Matches.

C. Who tests. This may be most pronounced with recent immigration. The more recent an Ancestor immigrates, the fewer number of living descendants in the US, where the highest percentage of atDNA test takers live. Recent immigration => fewer cousins tested => fewer Matches.

D. Endogamy. With endogamy, the amount of shared DNA increases, resulting in many more Matches who otherwise might not have matched. The classic case is Ashkenazi Jews, who tend to get many times the number of Matches others get, and the Matches are in fact somewhat more distant than indicated. This also applies to other endogamous populations. Endogamy => more cousins => more Matches.

Some of my experience and observations

My maternal grandmother was from immigrants to US in 1850s. Roughly 1/4 of my atDNA is from her, and 1/4 of my Chromosome Map has TGs from her line. But, I get relatively few Matches on this 1/4 of my Ancestry, and some of the “gaps” in my Map have no Matches in them at all. When I get only 1 or 2 Matches in a TG on my maternal side, it’s pretty certain they are from my grandmother’s line. The other maternal TGs are from my mother’s paternal side with deep Colonial Virginia roots – they tend to have many Matches.

As the Matches pour in, and are added to my TGs and Chromosome Map, I can see where some TGs have many Matches and must come from distant Ancestors – perhaps at the limits of my Tree, or beyond – so I focus on the closer MRCAs (Most Recent Common Ancestors) which provide pointers. In some cases I’ve found the Ancestor had a Plantation and raised many children.

Some of the other TGs have small to medium numbers of Matches, and where I can find an MRCA, they tend to be smaller Colonial families – folks who moved around a lot; died earlier in life; and/or had few known children who survived.

So as you form TGs, and look at the resulting Chromosome Map, they start to paint a picture…

[22K] Segment-ology: Amount of DNA vs Number of Matches TIDBIT; by Jim Bartlett 20170517

A Segment-ology TIDBIT

Here is an example of the format I usually use for AncestryDNA Notes, line by line:

[1] 6C1R: BUTCHER/BUSH > Valentine

[2] Gm 123456 [03F25] 15.1cM

[3] 12.4cM/1Seg

[4] SM: 6 [1 is 4C on BUTCHER]

[5] m5/4/17; mr5/5/17w/Gm#

[6] Added to xls & Notes

[1] is just line number, you don’t need to type that in

6C1R is 6th cousin once removed; 4Cx2 is double 4C

BUTCHER/BUSH is our MRCA surnames > Valentine is the child my Match descends from

Gm is GEDmatch ID

[03F25] is my Triangulated Group ID [Chr 3; F means start in 50-60Mbp range; 25 are first two Ahnentafel numbers of my ancestry to the MRCA – in this case my father and his mother]

15.1cM is cM per GEDmatch

12.4cM is cM per AncestryDNA [almost always smaller]

SM = Shared Matches; then the number of them; then any key info about any of them [in brackets]

M5/4/17 means I posted a message; mr means reply

Added note confirms info was added to my spreadsheet and my Notes on each surname

This info is available by hovering over the little document icon next to each Matches name. It is very useful in its own right to help me keep track of each Match. It is also particularly valuable when reviewing a list of Shared Matches – you can easily see which ones you have notes for and what the notes say. This helps you decide if a group of Shared Matches are grouping around one ancestral line.

I do this pretty religiously for all Matches with Hints; and try to do it for all 4C Matches, too – no time is left for the tens of thousands of other AncestryDNA Matches.

[22I] Segment-ology: Format for AncestryDNA Notes TIDBITS; by Jim Bartlett