Statistical significance of cis-regulatory modules
- PMID: 17241466
- PMCID: PMC1796902
- DOI: 10.1186/1471-2105-8-19
Statistical significance of cis-regulatory modules
Abstract
Background: It is becoming increasingly important for researchers to be able to scan through large genomic regions for transcription factor binding sites or clusters of binding sites forming cis-regulatory modules. Correspondingly, there has been a push to develop algorithms for the rapid detection and assessment of cis-regulatory modules. While various algorithms for this purpose have been introduced, most are not well suited for rapid, genome scale scanning.
Results: We introduce methods designed for the detection and statistical evaluation of cis-regulatory modules, modeled as either clusters of individual binding sites or as combinations of sites with constrained organization. In order to determine the statistical significance of module sites, we first need a method to determine the statistical significance of single transcription factor binding site matches. We introduce a straightforward method of estimating the statistical significance of single site matches using a database of known promoters to produce data structures that can be used to estimate p-values for binding site matches. We next introduce a technique to calculate the statistical significance of the arrangement of binding sites within a module using a max-gap model. If the module scanned for has defined organizational parameters, the probability of the module is corrected to account for organizational constraints. The statistical significance of single site matches and the architecture of sites within the module can be combined to provide an overall estimation of statistical significance of cis-regulatory module sites.
Conclusion: The methods introduced in this paper allow for the detection and statistical evaluation of single transcription factor binding sites and cis-regulatory modules. The features described are implemented in the Search Tool for Occurrences of Regulatory Motifs (STORM) and MODSTORM software.
Figures
Similar articles
-
Searching for statistically significant regulatory modules.Bioinformatics. 2003 Oct;19 Suppl 2:ii16-25. doi: 10.1093/bioinformatics/btg1054. Bioinformatics. 2003. PMID: 14534166
-
Computational detection of cis -regulatory modules.Bioinformatics. 2003 Oct;19 Suppl 2:ii5-14. doi: 10.1093/bioinformatics/btg1052. Bioinformatics. 2003. PMID: 14534164
-
Recognition of cis-regulatory elements with vombat.J Bioinform Comput Biol. 2007 Apr;5(2B):561-77. doi: 10.1142/s0219720007002886. J Bioinform Comput Biol. 2007. PMID: 17636862
-
Finding regulatory elements and regulatory motifs: a general probabilistic framework.BMC Bioinformatics. 2007 Sep 27;8 Suppl 6(Suppl 6):S4. doi: 10.1186/1471-2105-8-S6-S4. BMC Bioinformatics. 2007. PMID: 17903285 Free PMC article. Review.
-
Computational approaches to finding and analyzing cis-regulatory elements.Methods Cell Biol. 2008;87:337-65. doi: 10.1016/S0091-679X(08)00218-5. Methods Cell Biol. 2008. PMID: 18485306 Review.
Cited by
-
SETDB1 regulates short interspersed nuclear elements and chromatin loop organization in mouse neural precursor cells.Genome Biol. 2024 Jul 3;25(1):175. doi: 10.1186/s13059-024-03327-2. Genome Biol. 2024. PMID: 38961490 Free PMC article.
-
Gene set-based module discovery in the breast cancer transcriptome.BMC Bioinformatics. 2009 Feb 26;10:71. doi: 10.1186/1471-2105-10-71. BMC Bioinformatics. 2009. PMID: 19243633 Free PMC article.
-
Ubiquitin accumulation in autophagy-deficient mice is dependent on the Nrf2-mediated stress response pathway: a potential role for protein aggregation in autophagic substrate selection.J Cell Biol. 2010 Nov 1;191(3):537-52. doi: 10.1083/jcb.201005012. J Cell Biol. 2010. PMID: 21041446 Free PMC article.
-
ChIP-Seq analysis identifies p27(Kip1)-_target genes involved in cell adhesion and cell signalling in mouse embryonic fibroblasts.PLoS One. 2017 Nov 20;12(11):e0187891. doi: 10.1371/journal.pone.0187891. eCollection 2017. PLoS One. 2017. PMID: 29155860 Free PMC article.
-
Prediction of CTCF loop anchor based on machine learning.Front Genet. 2023 Apr 3;14:1181956. doi: 10.3389/fgene.2023.1181956. eCollection 2023. Front Genet. 2023. PMID: 37077544 Free PMC article.
References
-
- Leighton P, Saam J, Ingram R, Stewart C, Tilghman S. An enhancer deletion affects both H19 and Igf2 expression. Genes Dev. 1995;9:2079–2089. - PubMed
-
- Staden R. Methods for calculating the probabilities of finding patterns in sequences. Computer Applications in the Biosciences. 1989;5:89–96. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous