The sampling distribution of linkage disequilibrium under an infinite allele model without selection
- PMID: 3979817
- PMCID: PMC1216291
- DOI: 10.1093/genetics/109.3.611
The sampling distribution of linkage disequilibrium under an infinite allele model without selection
Abstract
The sampling distributions of several statistics that measure the association of alleles on gametes (linkage disequilibrium) are estimated under a two-locus neutral infinite allele model using an efficient Monte Carlo method. An often used approximation for the mean squared linkage disequilibrium is shown to be inaccurate unless the proper statistical conditioning is used. The joint distribution of linkage disequilibrium and the allele frequencies in the sample is studied. This estimated joint distribution is sufficient for obtaining an approximate maximum likelihood estimate of C = 4Nc, where N is the population size and c is the recombination rate. It has been suggested that observations of high linkage disequilibrium might be a good basis for rejecting a neutral model in favor of a model in which natural selection maintains genetic variation. It is found that a single sample of chromosomes, examined at two loci cannot provide sufficient information for such a test if C less than 10, because with C this small, very high levels of linkage disequilibrium are not unexpected under the neutral model. In samples of size 50, it is found that, even when C is as large as 50, the distribution of linkage disequilibrium conditional on the allele frequencies is substantially different from the distribution when there is no linkage between the loci. When conditioned on the number of alleles at each locus in the sample, all of the sample statistics examined are nearly independent of theta = 4N mu, where mu is the neutral mutation rate.
Similar articles
-
The sampling distribution of linkage disequilibrium.Genetics. 1984 Sep;108(1):257-74. doi: 10.1093/genetics/108.1.257. Genetics. 1984. PMID: 6479585 Free PMC article.
-
Variance in estimated pairwise genetic distance under high versus low coverage sequencing: The contribution of linkage disequilibrium.Theor Popul Biol. 2017 Oct;117:51-63. doi: 10.1016/j.tpb.2017.08.001. Epub 2017 Aug 24. Theor Popul Biol. 2017. PMID: 28842178
-
Approximating genealogies for partially linked neutral loci under a selective sweep.J Math Biol. 2007 Sep;55(3):299-330. doi: 10.1007/s00285-007-0085-7. Epub 2007 Mar 30. J Math Biol. 2007. PMID: 17396267
-
On selecting markers for association studies: patterns of linkage disequilibrium between two and three diallelic loci.Genet Epidemiol. 2003 Jan;24(1):57-67. doi: 10.1002/gepi.10217. Genet Epidemiol. 2003. PMID: 12508256 Review.
-
Statistical genetics in and out of quasi-linkage equilibrium.Rep Prog Phys. 2023 Apr 4;86(5). doi: 10.1088/1361-6633/acc5fa. Rep Prog Phys. 2023. PMID: 36944245 Review.
Cited by
-
Selection, hitchhiking and disequilibrium analysis at three linked loci with application to HLA data.Genetics. 1991 Nov;129(3):931-48. doi: 10.1093/genetics/129.3.931. Genetics. 1991. PMID: 1752429 Free PMC article.
-
Polymorphic DNA haplotypes at the human phenylalanine hydroxylase locus and their relationship with phenylketonuria.Hum Genet. 1987 May;76(1):40-6. doi: 10.1007/BF00283048. Hum Genet. 1987. PMID: 2883110
-
Casein haplotype diversity in seven dairy goat breeds.Arch Anim Breed. 2019 Jul 24;62(2):447-454. doi: 10.5194/aab-62-447-2019. eCollection 2019. Arch Anim Breed. 2019. PMID: 31807656 Free PMC article.
-
Estimating linkage disequilibrium from genotypes under Hardy-Weinberg equilibrium.BMC Genet. 2020 Feb 26;21(1):21. doi: 10.1186/s12863-020-0818-9. BMC Genet. 2020. PMID: 32102657 Free PMC article.
-
Linkage disequilibrium predicts physical distance in the adenomatous polyposis coli region.Am J Hum Genet. 1994 May;54(5):884-98. Am J Hum Genet. 1994. PMID: 8178829 Free PMC article.
References
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials