Allele Frequency Graphs by Y Haplogroup
In January of 2004 I collected together every FTDNA haplotype I could find on the Internet. (Much to my chagrin, this was before FTDNA came out with the 37 marker extension - so I have no data on the new markers).
There is an inherent problem with this data in that it all came from surname studies. This means it is not anywhere near a randomly sampled dataset such as is normally used in genetics studies. The best I could do to minimize the biases due to individual surname studies was to delete from the dataset any haplotypes that appeared more than once in each surname study. These haplotypes came from over 200 different surname studies - so we can hope that there was enough "averaging" over all those studies to give us something close to what we'd find in a true randomly sampled dataset of unrelated men. After "same webpage" duplicates were subtracted out, I had 2500+ twentyfive marker haplotypes. The allele frequency graphs available below were put together using that dataset. Haplogroups were estimated using methods similar to those given on my homepage.
|
to R1b Modal Haplotype |
(with DYS455=8) |
| |
|
My results have numbers for four DYS464 markers. How come there is
only one DYS464 graph?
DYS464 is an example of what is called a "multi-copy" marker. "Multi-copy" means that the marker exists at
more than one place along the Y chromosome. When the lab tests that marker, it ends up getting back a different
answer from each of the copies - with no way of knowing from which position each answer came. What the lab does
instead is just list all four answers in numerical order - from smallest to largest - and call them 464a, 464b, 464c, and 464d.
So the number of repeats that your results give for DYS464c isn't the pattern seen at a given place "c" on the Y
chromosome, it is just the second largest of your four DYS464 results. Since we can't really keep track of
what repeats came from which place, it doesn't make sense to treat them individually. All we can
really talk about is which numbers of repeats are most commonly seen among the four DYS464 alleles.
Therefore only one graph is given for DYS464, and the total height at each allele is given as the sum of the
frequencies of that allele among all four DYS464 results. The graph should be interpreted as showing
which alleles a
person in that haplogroup is likely to see among his four results for DYS464. If a given allele shows a
frequency of, say "1.9", this means that members of that haplogroup will commonly have that allele
appearing twice in their list of DYS464 results. The other multicopy markers, DYS385, and DYS459 are
treated similarly.