Statistical properties of the number of recombination events in the history of a sample of DNA sequences
- PMID: 4029609
- PMCID: PMC1202594
- DOI: 10.1093/genetics/111.1.147
Statistical properties of the number of recombination events in the history of a sample of DNA sequences
Abstract
Some statistical properties of samples of DNA sequences are studied under an infinite-site neutral model with recombination. The two quantities of interest are R, the number of recombination events in the history of a sample of sequences, and RM, the number of recombination events that can be parsimoniously inferred from a sample of sequences. Formulas are derived for the mean and variance of R. In contrast to R, RM can be determined from the sample. Since no formulas are known for the mean and variance of RM, they are estimated with Monte Carlo simulations. It is found that RM is often much less than R, therefore, the number of recombination events may be greatly under-estimated in a parsimonious reconstruction of the history of a sample. The statistic RM can be used to estimate the product of the recombination rate and the population size or, if the recombination rate is known, to estimate the population size. To illustrate this, DNA sequences from the Adh region of Drosophila melanogaster are used to estimate the effective population size of this species.
Similar articles
-
On the frequency of undetectable recombination events.Genetics. 1986 Apr;112(4):923-6. doi: 10.1093/genetics/112.4.923. Genetics. 1986. PMID: 3957012 Free PMC article.
-
Estimating effective population size or mutation rate using the frequencies of mutations of various classes in a sample of DNA sequences.Genetics. 1994 Dec;138(4):1375-86. doi: 10.1093/genetics/138.4.1375. Genetics. 1994. PMID: 7896116 Free PMC article.
-
The coalescent process in models with selection and recombination.Genetics. 1988 Nov;120(3):831-40. doi: 10.1093/genetics/120.3.831. Genetics. 1988. PMID: 3147214 Free PMC article.
-
Structure and function of the shufflon in plasmid R64.Adv Biophys. 2004;38:183-213. Adv Biophys. 2004. PMID: 15493334 Review.
-
Are you my mother? Bayesian phylogenetic inference of recombination among putative parental strains.Appl Bioinformatics. 2003;2(3):131-44. Appl Bioinformatics. 2003. PMID: 15130798 Review.
Cited by
-
Characterization of genetic diversity and population structure within Staphylococcus chromogenes by multilocus sequence typing.PLoS One. 2021 Mar 15;16(3):e0243688. doi: 10.1371/journal.pone.0243688. eCollection 2021. PLoS One. 2021. PMID: 33720932 Free PMC article.
-
Failing the four-gamete test enables exact phasing: the Corners' Algorithm.Genet Sel Evol. 2022 Nov 14;54(1):74. doi: 10.1186/s12711-022-00763-1. Genet Sel Evol. 2022. PMID: 36376786 Free PMC article.
-
Population structure and demographic history of a tropical lowland rainforest tree species Shorea parvifolia (Dipterocarpaceae) from Southeastern Asia.Ecol Evol. 2012 Jul;2(7):1663-75. doi: 10.1002/ece3.284. Ecol Evol. 2012. PMID: 22957170 Free PMC article.
-
Global spread and genetic variants of the two CYP9M10 haplotype forms associated with insecticide resistance in Culex quinquefasciatus Say.Heredity (Edinb). 2013 Sep;111(3):216-26. doi: 10.1038/hdy.2013.40. Epub 2013 May 1. Heredity (Edinb). 2013. PMID: 23632895 Free PMC article.
-
A genetic polymorphism evolving in parallel in two cell compartments and in two clades.BMC Evol Biol. 2013 Jan 12;13:9. doi: 10.1186/1471-2148-13-9. BMC Evol Biol. 2013. PMID: 23311980 Free PMC article.
References
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases