segment-ology

Pro Tools Part 19

Featured

Posted on December 11, 2024 by Jim Bartlett

Comments on Sacrilegious Genetic Genealogy

I thought these comments were excellent and wanted to share them.

Guest Post from Terry Butcher dated 11 Dec 2024

In regards to your Pro Tools Part 16 Sacrilegious Genetic Genealogy post, I would like to share some thoughts on the topic.

While I appreciate the power that various DNA analysis techniques offer in identifying clusters of matches to specific common ancestors, my primary focus has always been about the genealogy side of the effort.

I feel that I need to connect my tree to each match to really have anything of value. I already accept that I am related to my matches (within the parameters you have described related to cM size). Being able to document the relationship and share it with my matches is my reward for investing the time and effort in researching them.

I try to make a connection with each match and approach each one as an opportunity to learn something new. Each match that I find a common ancestor for in essence validates that specific branch of my tree by having both a paper trail and a DNA match.

I add my matches tree into my tree as I research them. I start by adding them as an unrelated person in my tree and start working back along their tree picking up all of their branches until I either find a common ancestor, hit a dead end or believe there is no longer any possibility because of location has gone back to Europe. It usually doesn’t take long to find most CAs. While researching a match, I usually only add parents and the child, ignoring the other siblings to save effort. However, if I am successful in finding our CA, I will usually go back and pick up the other siblings for several of the most recent generations.

I have been systematically working my way through my matches starting with closest related and have made it down to the 41 cM matches (about 2,000 so far). If the match has useful information in their tree, I have been successful about 90-95% of the time. In the past, I would contact matches without trees and offer assistance. Now with Shared Matches Pro I am able to find their close matches with trees and sometimes find a CA. This is much welcomed capability that changes what is possible in my research. I have a total of 132k matches now with 11,500 marked as 4^th cousin or closer. It would take me many, many years to even get through the 4^th cousins and closer matches so I am not worried about running out of matches to research that I have an excellent chance of finding a CA.

For the 5-10% of my matches that I build their tree but can not find a CA, I suspect they may be either connected with 2 brick walls that I have at 3rd GGF or some unknown adoption or incorrect parent in my tree. Several of these unsolved CA matches now tie together in their trees and I am hopeful they will eventually result in solutions.

By working through my matches and incorporating their trees into my tree, I have expanded my tree significantly to over 222k people now. As nearly all of my ancestors have lived in WV since the early 1800’s, my tree is heavily weighted with WV families. I typically don’t have to add but a generation or two until I find my CA.

I am not concerned about having floating tree branches as I believe they will eventually connect into my overall tree. Anytime I encounter a common surname in my research, I chase it back until it connects with other members of that family which strengthens the connections in my tree.

I value the ability to generate family tree reports showing the relationship path between my match and myself and always share the typically one-page report with my match by saving it to my Dropbox folder and sharing a link in the message I send them.

Any match that I can connect to my tree to a CA has over 10k ancestors (and their descendants) with many up to 40k.

My approach over my 30 years of genealogy as a hobby has evolved as it has for most I suspect. As I research, I pick up as much information as I can including photos, obituaries and sometimes other records like draft registration documents, marriage and death certificates. All of these documents are incorporated into the detailed reports I generate whenever the person is included in the report which makes for some very interesting reading for my matches when I share reports with them. I find that Ancestry provides 98% of my information with a bit of help from the other sites whenever I hit a dead end in Ancestry.

[22DA] Segment-ology: Pro Tools Part 19 – Comments on Sacrilegious Genetic Genealogy by Terry Butcher 20241211

Pro Tools Part 18

Featured

Posted on December 10, 2024 by Jim Bartlett

Family Group Sheets

One of the key features of my Common Ancestor Spreadsheet (see post here) is that it offers an arrangement like a traditional genealogy Family Group Sheet (FGS). The FGS has an Ancestor couple at the top of the sheet, with a list of their children down the page with birth, death, marriage dates and places. If we are going to create an inventory of our DNA Matches with known links to an MRCA, this FGS spreadsheet format would be a great way to do that. It also turns out to be a handy tool when working with Pro Tools.

The Common Ancestor spreadsheet for Match cousins is actually a “nested” FGS. By sorting on Ancestor Ahnentafel Numbers, all the Matches connected to one Ancestor are grouped together. By also sorting on the birth year of the Ancestor’s children, this “FGS sort” results with Matches grouped under each child. By adding sorts on birth years for grandchildren and great grandchildren, we get a “nested” FGS. I regularly use my entire spreadsheet sorted by these four columns.

This arrangement has several advantages when using Pro Tools…

1. When Pro Tools indicates a parent/child or sibling relationship to an existing Match (already entered into the spreadsheet), I can create a new row and copy most of the info and just adjust one column – a real time saver. And this works even with new Matches with No Tree, Private Tree, Unlinked Tree, Scrawny Tree, even small cMs – Pro Tools has already provided all the relationship information needed.

2. When Pro Tools indicates a (full) 1C relationship to an existing Match, this limits the relationship possibilities to only two. [In my experience, 1C estimates are highly accurate.] Analysis: the new Match is connected to the existing Match (already in the spreadsheet) on (1) the same side I am on, or (2) on the other side. Be aware of this! If the new Match is on the “other” side, they are NOT part of this Ancestor (Ahnentafel) line. If the new Match has any info in a Tree, this “side” issue can usually be figured out and the spreadsheet cells filled out (mostly by copying from the existing Match). If there is no Tree info, the “side” can usually be determined by looking at the Shared Matches of the new Match (sorted on new Match’s cMs). There should be a clear consensus (at/near the top of that list) of the same Ancestor line as the existing Match. If not, then skip this new Match. If so, I add a row for the new Match, copy data from the existing Match, and enter GUESS for the new Match parent (as a sibling of the existing Match parent), and then the new Match [NB: to save typing, I indicate each “terminal” Match as an asterisk (*) because they are already spelled out in the Match column near the beginning of the row.]

Analysis summary: A) look at their Tree; and/or B) look at their closest SMOMs.

3. For a 1C1R or 1C2R the estimates are still very good, and the process above can be used. Use available info or judgement to shift the new Match to the right or left per the “removes”. Where the individuals are not known, just put Unknown or Private in the cell. The complete path down to the Match is not critical, IMO.

4. When Pro Tools indicate Aunt/Uncle or Niece/Nephew, that too is highly accurate, as are the genders. Similar to the above, there is usually enough information to place them in the spreadsheet (which is like a horizontal Tree).

5. Pro Tools often includes a Half relationship in their estimate. This is based on tables that indicate two estimates shown are almost exactly the same cM range. Although technically correct, it is much more likely, IMO, that the relationships are standard (NOT Half). But a few will be Half so watch for that situation. Remember these Pro Tools cMs are between your Match and the Shared Match (not affected by whether or not you have a Half relationships with the Ancestor)

6. Adding a hitherto unknown child branch – best described by a recent example I had. In looking for my A38 (ALLEN ancestor) cousins, I found a bunch descending from four well documented children of A38 – 56 Match cousins (4C, 4C1R, 4C2R and 4C3R) with an average of 18cM. There appears to be more than four children in the 1810-30 Virginia census records. And there was an old story about this family, that a son named William went west. So when some known Matches had some SMOMs with ancestor William H ALLEN born 1815 in VA and living in IL, I took notice – it seemed to fit. As I pushed it with Pro Tools I found (so far) 10 Matches descending from William H ALLEN averaging 20cM. But more importantly, those Matches also had Shared Matches with 12 of the 56 Matches from other children from this A38. It sure looked like a Cluster with gray cells to other Clusters! I’d really like to determine William’s Y-DNA; and/or some DNA segment data… But, in the meantime, I’ve got two of William’s descendants checking their Matches for links to my A38 ALLEN. There are 147 Trees at Ancestry for William H ALLEN – not a one has any good clue to his ancestry, except that he was born in VA. Not my Brick Wall, but I think there will be 147 happy campers.

A key point in this long story, is the DNA has no sense of geography. The facts that four children stayed in VA (and were well known) and one child moved far away, made no difference to the DNA. From each descendant’s viewpoint, all the lines were equal – and a pretty even distribution of Matches showed up for all 5 children. The DNA is like blind justice.

7. Equality – a final thought is that this spreadsheet is a lot like the DNA – it’s relatively equal over all the Ancestors and descendants. This spreadsheet encourages me to treat all of my Ancestors equally (they each have an Ahnentafel placeholder row). I still have my “favorite” Ancestors, but as I methodically go through the spreadsheet, I’m spending time on each one. This includes the Ancestors that have issues… This spreadsheet also highlights the Brick Wall holes, to be plugged with floating family branches. This is a good thing.

To me, the key points in doing this spreadsheet work also include:

1. An inventory of Matches who have MRCAs with me. Separate from my on-line Tree. Saved in the cloud and/or archived – available to my heirs or selected genealogy archives someday.

2. Family Group Sheets – of sorts* – this is a standard genealogy tool.

3. A Quality Control check on the accuracy of name spelling and birth years; and the FGS itself. This QC review often reveals “quirks” (as a kinder word) that folks have in their Trees…

4. With Ancestor second marriages, this FGS listing will show the demarcation between full cousins and Half cousins. [I add “INSIGHT” rows with marriage years that will sort and separate the children to the different parent couples.] Half cousins for me only occur at the children level in my spreadsheet. Half cousins between Matches and Shared Matches can occur anywhere.

5. A re-sort by Match name highlights multiple relationships. Since shared DNA is divided by 4 (on average) going back each generation, the closer relationships are much more likely. I’ve found some Matches with MRCAs on both sides of my Tree. With single shared segments, the DNA can only come from one Ancestor. With multiple shared segments, there may be a segment for each line.

* I used “of sorts” in 2 above, because this FGS will not usually be a complete list of all Ancestor children, grandchildren, etc. It includes only the ones who provided a DNA path down to our Matches. Which in turn depends on family sizes and who did DNA tests – there can be wide variations on both.

Note: If I were starting over, I’d probably add name & birth year columns for 9 generations – out to 8C level; and then a catch-all column for any additional info. This would provide a handy way to evaluate the cousinship levels. Reminder: I only list the given name and one initial for males; and the given name, initial and married surname for females. I try to keep it as easy and simple as possible.

Bottom line: An FGS spreadsheet offers an easy way to add new Matches which have been identified by Pro Tools as closely related to known Matches. This adds independent, genealogy triangulation and tight Clusters to an inventory of known Matches. It will be an outstanding adjunct to an auto-Clustering program.

Also – you don’t have to use a spreadsheet to benefit from most of the concepts imbedded above.

[22CZ] Segment-ology: Pro Tools Part 18 – Family Group Sheets by Jim Bartlett 20241209

Pro Tools Part 17

Featured

Posted on December 8, 2024 by Jim Bartlett

NPEs

If we just consider our own ancestral line, we may miss some NPE’s. We may have an NPE as an Ancestor, IF we haven’t explored the whole family.

Way back, NPE was Non-Paternal Event, but we’ve seen non-Maternal events, too. So we changed it to Not the Parent Expected. The whole issue centers around the expectation of a family with two “expected” parents. Important: an NPE is usually for one child – perhaps your Ancestor; perhaps a different child in the family. We “expect” all the children in a family to be from the husband and wife. So “usually” an NPE is a one-off event. But life unfolds in many different ways…

A man and a woman create a child – sometimes one of them is not married (i.e. living with their parents, or on their own) – or perhaps this is the case for both of them. Sometimes they are both married to someone else. Sometimes the man is not (or ever) aware the woman got pregnant. Again – in life, there are many variations to this. The point is the NPE does not apply to a family – it applies to a child. This is important to DNA analysis, and how we use Pro Tools.

I have this case for one of my Ancestors. The pregnant woman was an unmarried child in a family who raised her and her son, giving him their surname (which has confused genealogists to this day). It appears the father was not yet married either, but he went on to marry and have children. I know because I got some DNA from him (through the NPE child) and have Matches who descend from him through his other children (half cousins), and though her children by her later marriage (half cousins). [NB: Challenging in my Common Ancestor spreadsheet.]

Getting back to Pro Tools – the DNA truth-teller/helper. In general, the higher-cM SMOM interrelationships lead to one generational level in my Tree – to one MRCA couple. They may be cousins 1 or 2 or 3 times removed (because I’m old), but usually all go back to one MRCA. Then, as I scroll down the SMOM list, I often find SMOMs who descend from one generation further back. This is normal and expected. These would be a generation more distant to us, and should have appropriately smaller cMs, on average. In fact, if this doesn’t happen, we should be suspicious.

NB: Alternatively, some highest-cM Matches may be tied to a closer generation (which should be, on average, a higher-cM relationship). If these higher-cM Matches are at the same generation level, it may be due to multiple segments and, perhaps, additional relationships (with Colonia Virginia ancestry, I sometimes find multiple relationships with some Matches).

Finally, back to NPEs… If one of the Ancestors in an MRCA couple is an NPE, you wouldn’t get any Matches to that couple (just like with an only child; an exception would be if they had more than one child together). So, instead, look to see that *some* of the Matches are from each bio-parent. This is how I solved a Brick Wall. I had many Matches to my A36 (4C level) Ancestors [Thomas NEWLON & unknown wife]. As I kept looking at the Shared Matches, I found some smaller-cM Matches to my A72 (5C level) couple [Thomas NEWLON’s parents] who had been well researched. Analysis of “other” Shared Matches revealed many had the CUMMINGS surname (now my A74; 5C level ancestor).

The point is that if Pro Tools points to a group of higher-cM Matches to a 3C, 4C, etc MRCA; the lower-cM Match should point to groups for the next two MRCAs back. This is true whether these MRCAs are well known or an NPE or a Brick Wall. If you find a consensus Ancestor among these smaller-cM Matches you may have found GOLD.

Bottom Line: When dealing with an NPE, think carefully about what that means to Pro Tools, and target your “rabbit holes” appropriately;>j

[22CY] Segment-ology: Pro Tools Part 17 – NPEs by Jim Bartlett 20241208

Pro Tools Part 16

Featured

Posted on December 5, 2024 by Jim Bartlett

Sacrilegious Genetic Genealogy

For this post I want to explore a deviation from the normal genealogy and DNA research “requirements”.

Do we need to do comprehensive research on each cousin Match? Do I really need to find the complete link between each Match and our Common Ancestor? The sacrilige: do I care about all my distant cousins – to the extent that I must develop their complete link to me? Do I really care how much DNA they share with me? Must I link the DNA to the Common Ancestor? Or, is it enough to determine that they are on a specific branch of my Tree? I think so!

My standard mantra: our bio-Ancestors and DNA segments are set! We compare each Match to our Tree and DNA to find a Common Ancestor. I’m very close to finding out how 10% of my 100,000 Matches (at Ancestry) are related to my bio-Ancestors.

My experience with Pro Tools indicates many more can be easily found. I acknowledge that some shared DNA segments under 15cM will be false – but that doesn’t mean those Matches aren’t related to me. Most of our true cousins beyond 3C will not share any DNA with us, so is the cM amount beyond 3C meaningful? I acknowledge that some Matches will be related beyond a genealogy timeframe.

However, given these negative factors, I believe a lot more of my Matches are related to me within 9 generations back [8C level] – perhaps somewhat more than 20% of my total Matches. It’s taken me 14 years to “collect” and document approximately 10% of my Matches as cousins. It’s daunting to think what time and effort I’d need to double that.

My sacrilege is to give up on full genealogy research for each Match. Using Pro Tools I’m finding lots of 6-10cM (small segment) Matches (to me) that are children, nieces/nephews, or 1C to strong higher-cM Matches that I have placed in my Tree. Clearly, these Matches are part of a family group well within a genealogy time frame.

I’m inclined to just quickly:

1. Add these small-segment Matches to my Common Ancestor spreadsheet

2. Add a Match Note (at Ancestry) to indicate the Common Ancestor and/or Ahnentafel [e.g. #A0062]

3. Give them my standard star and MRCA Dot; but not the Dot indicating a linked Match

4. Use a new Dot to indicate “Likely” in a family group under the MRCA; but not complete research [I could always filter on that Dot later, and do the research, some day…]

5. Add a shorthand note like: SMOM: 3,442cM/son of “Match Name” [SMOM: Shared Matches of Match – the cM between them]

I’m looking for a more efficient way to group Matches into known family lines.

There are several points here:

1. Identify additional Matches within a genealogy timeframe (is it over 50% of all Matches?)

2. Group Matches under my Ancestor Couples – often under a specific child or grandchild (why would I need to dig deeper – unless the Match had a robust Tree with many records…)

3. Build a firm interrelated framework for later research on each extended “twig” of my Tree. Get some confidence of my Ancestors and their children and grandchildren.

4. Identify Brick Walls through clear absence of interconnected Matches. My spreadsheet has an Ahnentafel header for each of my Ancestors back to the 8C level – some of them have no known Matches, or what is clearly a small mess of non-interconnecting Matches. These are a judgment call, but with many more Matches involved, these few “problems” become more and more obvious.

5. Connect Floating branches – I now have several strong “clumps” of interconnected Matches, under a single MRCA couple, that I cannot link to my Tree. This is a strong hint in light of #4 above. I plan to explore this more in a separate blogpost.

For DNAGedCom, Genetic Affairs, DNA Painter: Any way to automate the Clusters/Groups to include only those Matches who interrelate, say, over 90cM (and make that threshold adjustable)?

Bottom line: I think many more , if not most, of our Matches will turn out to be real cousins within a genealogy timeframe (out through 8C level). This includes Matches with no Trees, Private Trees, UnLinked Trees and scrawny Trees – all of these are now put into the mix through Pro Tools. For me, compiling data from my 100,000 Ancestry Matches will be a way to bound (if not counter) the continued warnings that many of our Matches are false and/or distant. Some are, some are not – what can we learn?

As usual, I value your feedback – on the sacrilege of adding Matches to Tree branches based on strong interrelationships, but without fully documenting the genealogy; as well as the bigger picture of possibly linking Floating branches to “bare spots” in our Trees.

[22CX] Segment-ology: Pro Tools Part 16 – Sacrilegious Genetic Genealogy by Jim Bartlett 20241205

Pro Tools part 15

Featured

Posted on November 25, 2024 by Jim Bartlett

Shared Match Cluster Hints

I’ve written in this Pro Tools series about the power of Shared Matches. They form manual Clusters of Matches. Like all Clusters, they *tend* to point to a Common Ancestor. Each individual Match has their own ancestry, and they may relate to us in several different ways (particularly with my Colonial Virginia ancestry). With auto-Clustering this is displayed by placing the Match in a Cluster with the strongest ties to other Shared Matches – and using gray-cells to indicate ties to other Clusters. This shows up in a Shared Match list with a mix of Shared Matches tied to one Common Ancestor, along with other Shared Matches who may be related in different ways, and even some Shared Matches who might not be interrelated at all.

So, to make a point: Shared Match Clusters (or concentrations in Shared Match Lists) should be considered as a Hint. The stronger the consensus, the stronger the Hint. The chore that still remains is tracing the genealogy from the Match to a Common Ancestor(s).

I find that consensus is a judgment call. But when I make that call, I usually find other Matches with a genealogy link as expected. But not always…

Segment Triangulation is fairly precise – each of our DNA segments came to us from one particular ancestral path. Shared Matches (aka In Common With, aka Relatives in Common, etc) are not equivalent to Triangulation. When Shared Matches form a Cluster, it’s a strong Hint. And a 20×20 Cluster is much stronger than a 3×3 Cluster. And a 20×20 Cluster where each Match matches almost all of the other Matches is very strong, compared to a 20×20 Cluster where each Match only matches, say, half of the others… I have found large, strong Clusters (beyond close cousins) usually turn out to include one TG (maybe two), but there is no hard rule.

Summary: Shared Matches can grouped into Clusters. Clusters are not the same as Triangulated Groups (TGs), but they can be good pointers and helpful Hints.

[22CW] Segment-ology: Pro Tools Part 15: Shared Match Cluster Hints by Jim Bartlett 20241125

Pro Tools Part 14

Featured

Posted on November 24, 2024 by Jim Bartlett

Jigsaw Puzzles

Our genetic genealogy is very much like a jigsaw puzzle. Our Ancestors and our DNA segments are both pieces of a large jigsaw picture (ourselves). Soon after the moment of conception – when sperm meets egg – our DNA segments and crossover points are determined. And, of course, our Ancestors, each with 2 biological parents, are determined. There may be lots we don’t know, but those configurations (DNA and Ancestors) are fixed – waiting for us to discover them. Just like a box of jigsaw puzzle pieces, all the pieces are there – and they only go together one way (like our DNA segments and our Ancestors).

Now think about our DNA Matches – perhaps 100,000 of them – as we open our list… The overarching concept is that a Match sharing at least 15cM with us is always a true (Identical By Descent or IBD) relative; and over half of the remaining Matches will also be IBD and a true relative. Of course, some of these Match-relatives will be distant cousins.

Based on my deep dive with Pro Tools, I’m now convinced at least 20% of my DNA Matches at Ancestry are relatives within a genealogy time-frame. I’ll go out on a limb and say 8C or closer!.

So, to the point of this blog post… 20,000 of my 100,000 Matches are probably 8C or closer. Each one of them is a jigsaw puzzle piece. Each one interlocks with me (sometimes in multiple ways) and very often with other Matches (look at *their* Shared Match list). In many cases they form interlocking relationships with each other, from siblings to parent/child to 1C and 2C and 3C interrelationships. Just like a jigsaw puzzle. Some will be like the jigsaw lake, or forest, or barn or road – all of which “clumps” of the puzzle will eventually integrate – only one way – into the grand picture….

With Pro Tools’ new Sort feature (the Shared Matches’ *close to distant* Sort), it’s a whole lot easier to form small branches. Think of it this way…. You have 1,000 Matches, and you can easily find links that result in 500 pairs…. In a flash, you’ve cut your workload in half. And as you form larger clumps of Matches – all of your Matches in that clump must lead back to you! Put another way, look at the clump and see where all of your Matches have a Common Ancestral line – out of the clump and directly into your Tree – somehow…

The jigsaw puzzles:

The Ancestors must interlock in pairs and form an entire “Tree” jigsaw picture>
The DNA segments must array adjacently and form a Chromosome Map picture
Our Matches will interlock with us; each other; and our Ancestor Tree.

[22CV] Segment-ology: Pro Tools Part 14: Jigsaw Puzzles by Jim Bartlett 20241124

Pro Tools Part 13

Featured

Posted on November 17, 2024 by Jim Bartlett

Status of Common Ancestor Spreadsheet

I have a spreadsheet of all Matches with Common Ancestors with me. It includes my Ancestors and their children down to each Match. See more at https://segmentology.org/2021/12/19/segmentology-common-ancestor-spreadsheet/ It’s a lot of work, so feel free to adapt it suit your needs.

I have been reviewing all of these Matches and adding a LOT more using Pro Tools. I posted various ways to do this here, and I’ve gone down all those rabbit holes. I’m now on a march to review these Matches methodically – from closest Ancestors to more distant. I’ve found that it’s essential to have “known” Matches highlighted in Shared Match lists to speed the process of determining new Matches with CAs and forming family groups. So I’m adopting a two phase process. First: Recheck all Matches for firm relationships and having a clear set of Dots that will spotlight them in a Shared Match List – probably out to 5C level; Then: I’ll go back and use Pro Tools to tease out new Matches to add in.

Toward this end, I’m going to paste a Table below that shows my progress to date; and later I’ll update the Table to show the effect of Pro Tools. I’ve used Ahnentafel numbers (male of an ancestral couple) – their names are not needed for this exercise, although I did use given names for children for the first two generations. The comment column gives some reasons why the cMs deviate from the averages as when there are double Cousins or half Cousins, or Ancestors out of the US. You may also note the high number of Matches for Ahnentafel 70 – it’s because I jumped to that Ancestor, and used Pro Tools to find several key Matches to help with a burning question.

Here is where I stand now:

Note that this summary has 2477 Matches, through the 5C level (4XG grandparents). I have another 6,070 Matches in the 6C to 8C group. My total is 8,547 Matches from AncestryDNA, out of about 100,000 total – I wanted to see what impact Pro Tools will have. We’ll see how far I can get…

[22CU] Segment-ology: Pro Tools Part 13 – Status of Common Ancestor Spreadsheet by Jim Bartlett 20241117

Pro Tools Part 12

Featured

Posted on October 28, 2024 by Jim Bartlett

The jokes on me… heads up!

In my last post I noted that the Pro Tools cM relatedness was pretty accurate! Today I found two Matches who were 1C – their parents were brothers. But the SMOM said 1,637cM they had to be half siblings. I checked with DNAPainter – 1,637cM is 100% half siblings (for same generation relationship). Back to the drawing board… Did the two brother marry (or have children with) the same wife? Maybe one brother died, and the other married the widow… Nope. Checking some more – the two brothers married two sisters! They were double 1C! Not in the DNA Painter range of options, but spot on for twice the 1C cMs. All is OK, but it had me scratching my head for a few minutes.

[22CT] Segment-ology: Pro Tools Part 12 – The Jokes on Me by Jim Bartlett 20241028

Pro Tools Part 11

Featured

Posted on October 28, 2024 by Jim Bartlett

Ways to analyze Shared Matches Of Matches (SMOM) cMs.

Pro Tools gives us a LOT of new information. Not quite segment Triangulation, but very powerful data.

For example a Match shares 8cM with me and does not have a Tree. However, a SMOM shares 3,489cM with the Match, and Ancestry (with insider info) says the SMOM is the mother of the Match; and shares 17cM with me. As it turns out, I know the SMOM is a 3C1R with me on a particular Ancestor couple. It’s easy to 1. add the Match to my Tree; 2 add the Match to my Common Ancestor Spreadsheet; and 3. add a synopsis of this info (as a 3C2R) to the Match’s Notes. Of course this doesn’t happen every time, but it does happen some of the time.

The above example is a parent/child relationship, and Ancestry usually knows if it’s a son or daughter and a mother or father. Ancestry usually knows niece/nephew and aunt/uncle.

But the thrust of this blog post is about a family group and their interrelationships. I’ve tried several methods to document and analyze new Match/SMOM cMs. All methods utilize my Common Ancestor Spreadsheet which is arranged by family groups [I sort by Ahnentafel of the Common Ancestor; and the birth years of children, grandchildren and great grandchildren.] This CA spreadsheet is my foundation of “known” cousins – I’m looking at their Shared Matches to see if I can determine how we are related and add them to the spreadsheet; and checking to see that the existing cousins are interrelated to each other as expected.

First try was to add about 10 blank columns to the spreadsheet. I’d then type an asterisk [*] for a Match in a column, and enter the shared cMs with the other Matches in the spreadsheet in the same column. It was sort of like a Cluster matrix; and anyone who had a faulty genealogy was easily highlighted. But two issues: 1. It was a lot of work for a family group; 2. some of the Matches were in fact related up or down a generation [not physically close on the spreadsheet]; and 3. it was difficult (for me, anyway) to determine how an unknown Match would fit in… [someday I’ll try DNA Painter or BanyanDNA…]

The second try was just one new column, and I would type in the highest cM found among all the Shared Matches; the suggested relationship [almost always accurate for high cMs]; the Match name; and any known info. Issues: again, a lot of work; and some Matches don’t have any high cM SMOM with me. I still add these when they are the only evidence I have for adding a new Match to my Common Ancestor spreadsheet.

Third/current try involves about 3 new columns and I color in a column where Matches match most of the others. Sort of like LEEDs column-coloring. This is somewhat easier to do, without a lot of typing. And the colored “stripes” are comforting to see (and to highlight Matches who may not “belong” and/or need further research.)

Also, I’m hopping around some these days, working on specific issues (Brick Walls, questionable genealogy, trying to link in (or out) selected Matches). It appears that the closer generations have one stripy column and as I work on more distant Ancestors, the number of colored columns grows.

I’m still fiddling with good/efficient ways to use/display SMOM cMs; or even if I need to at all. I’ve worked on about 10% of my Matches in the Common Ancestor Spreadsheet. At every turn, Pro Tools is helping me find more and more Matches for whom I can determine our relationship. So still a long way to go – and I’m sure there are many more Matches to add to my spreadsheet.

You are encouraged to post in the comments any insights, tricks or hacks you’ve developed for using SMOM cMs…

[22CS] Segment-ology: Pro Tools 11 – Ways to Analyze SMOM cMs by Jim Bartlett 20241027

Pro Tools Part 10

Featured

Posted on August 13, 2024 by Jim Bartlett

Branch Groups

I’m methodically working my way through my Ancestors and Matches using Pro Tools. My main tool is my Common Ancestor Spreadsheet, which is now growing very rapidly. I’m not really in it for the bulk, but for the advantages of Branch Groups. What I call Branch Groups are groups of my DNA Matches under one child or grandchild of one of my Ancestors – these Matches are on the same Tree “branch”. Such Matches are closer to each other (than to me) and tend to share more DNA with each other. They stand out with DNA shares over 90cM; and I take notice. I can often “fit” them into a Branch Group. On the other hand, I’ve found some Matches that have the right genealogy for a Branch Group, but they don’t share much DNA with others in the Group – more on this below.

Here are some thoughts and observations:

SMOM – Shared Matches of Matches aka “Rabbit Holes” – haha. When you select a Match and click on the Shared Matches button – you get a list of all the Matches you both have in your respective Match lists. These are your Shared Matches (SMs) with that Match. Each of these SMs shares some DNA with you that you both got from the same Common Ancestor (CA). And, with Pro Tools, you know how much DNA each of these SMs also shares with the “base” Match that *they* got from some CA. Often these two CAs are the same (or one is ancestral to the other); but sometimes the CAs are completely different – *their* CA could be unrelated to you or related to you on a different line – see Outliers below). When we’ve done our homework and entered Notes for many Matches, we can usually look down the SM list and easily see if there is a consensus, or not – see Birds of a Feather below. Like with auto-Clustering, a consensus indicates a group of Matches that mostly match each other, indicating a Common Ancestor among them. Usually, their CA is also one of your Ancestors – BINGO! This is a Branch Group. Sometimes their CA is unknown to you – this could be a random happenstance. Or it could be a Floating Branch Group – see below.

Branch Group aka Cluster. When you find SMOMs who share high levels of shared DNA (cMs) with each other they usually form a Branch Group. By “high levels” I mean at least 90cM; but I often drop down to around 50cM as the group grows larger. I consider 20-25cM as “in the noise”, and usually not worth the trip down a rabbit hole. [For your own situation, experiment to find a threshold that usually gives you efficient results.] Sometimes you can get 5-10 (or more) of these SMOMs which link under a child or grandchild or Great grandchild of one of your Ancestors. And then it’s easier to find other SMOMs that fit into the Branch Group. Use an SMOM in a Branch Group to make a new Shared Match list, invariably with new SMOMs… the clues (or rabbit holes) are everywhere! As it turns out in a Branch Group, not all Match descendants will Match all of the other Matches in the group. Remember: at the 4C level, roughly 50% of true 4C won’t show up as matches to each other.

Birds of a Feather. On many Shared Match lists, a scan of the Notes indicates a clear consensus – most SMs have Notes indicating the same CA; and some are from the same line (up or down a generation). These are birds of a feather – they cluster together. And Pro Tools shows them to be close relatives – these are a Branch Group. In these cases, I’m much more likely to review Matches not yet linked in, and to build their Trees back to find the link. As a quick check, click on a Match and see *their* SMs with you – are they indeed Birds of a Feather? Or not?. For some Shared Match lists, a quick scan of existing Notes may indicate they are all over the place – on both sides; on different branches – so, it’s difficult to determine a consensus. Move on…

Outliers – linked by genealogy, but not linked by shared DNA. I’ve now run into a very few cases of DNA Matches who are clearly genealogy relatives (in my Common Ancestors spreadsheet) under Ancestor XYZ, but they do not share DNA with other close cousin Matches under XYZ. In each case, so far, they are also related to me in another way, and they do share DNA with their other cousins. Thinking about multiple segments and/or multiple relationships leads me to Triangulated segments, but I’ll put that discussion off for a future blog post. Just be aware that a Match with one shared segment can only be genetically related one way. Pro Tools may help determine which one.

Collateral SURNAMES in Branch Groups. Less than 1% of my Matches have the same SURNAME as the CA we share [Y and mt lines are pretty rare]. This means my Common Ancestor spreadsheet (tracking the lines of descent down to Matches) includes Collateral SURNAMEs. As I’m working on an MCRA Branch Group in my spreadsheet, I’m reviewing each of my Match cousins, and reviewing all of the SMOM shared cMs, and checking the Trees of those over 90cM (and glancing at some down to 50cM). Often there is enough to tie those Matches to my Tree (even some with no Trees). It really helps to review the Collateral SURNAMEs already recorded in my spreadsheet for that Branch – that’s usually where I’m going to find a link. And it means I don’t have to build a tree back for each Match – I can usually copy the line of descent of an existing Match in the spreadsheet, and just change the last few generations. A big time saver – in searching and typing… Recognizing a Collateral SURNAME in a Match’s scrawny Tree is helpful. Sometimes I’ll filter a long Shared Match list by a Collateral SURNAME…

Floating Branch Groups. A few times I’ve found a Branch Group that I cannot link to my Tree. They usually include parent/children, siblings, aunt/uncle/niece/nephew, and maybe some 1C or 2C, all in a tight family group. All the interrelationship cMs are on target. But, other than being on a Shared Match list with some known Matches in a Branch Group, I cannot find a link. In most cases this has happened “near” a Brick Wall (or “iffy”) Ancestor of mine. So I’ve created a Floating Branch in my Tree, so I can link other Matches to it. I need to do a study of closest known Matches to see where this Floating Branch is headed – another rabbit hole. Such a Floating Branch could just be a mirage (not really linked to me), or I might find some “tendril” Matches (maybe through a Collateral SURNAME filter) that help find the link. I operate under the belief that ALL Matches over 15cM (and many under 15cM) are true cousins, and many are within a genealogy timeframe and should fit in my Tree somewhere.

I am now convinced of two things: A) A lot more of our under-20cM Matches are well within our genealogy timeframe than I originally thought; and B) our Brick Walls (out to at least 8C level) have plenty of Matches forming Branch Groups. With each generation going back, it’s harder and harder to figure them out, but Pro Tools can often provide new insights. This helps offset the fact that many Matches have NO Trees or very scrawny Trees. There is hope! But it takes work!

[22CR] Segment-ology: Pro Tools Part 10 Branch Groups by Jim Bartlett 20240812