Identification of consensus patterns in unaligned DNA sequences known to be functionally related
- PMID: 2193692
- DOI: 10.1093/bioinformatics/6.2.81
Identification of consensus patterns in unaligned DNA sequences known to be functionally related
Abstract
We have developed a method for identifying consensus patterns in a set of unaligned DNA sequences known to bind a common protein or to have some other common biochemical function. The method is based on a matrix representation of binding site patterns. Each row of the matrix represents one of the four possible bases, each column represents one of the positions of the binding site and each element is determined by the frequency the indicated base occurs at the indicated position. The goal of the method is to find the most significant matrix--i.e. the one with the lowest probability of occurring by chance--out of all the matrices that can be formed from the set of related sequences. The reliability of the method improves with the number of sequences, while the time required increases only linearly with the number of sequences. To test this method, we analysed 11 DNA sequences containing promoters regulated by the Escherichia coli LexA protein. The matrices we found were consistent with the known consensus sequence, and could distinguish the generally accepted LexA binding sites from other DNA sequences.
Similar articles
-
Identification of common motifs in unaligned DNA sequences: application to Escherichia coli Lrp regulon.Comput Appl Biosci. 1995 Aug;11(4):379-87. doi: 10.1093/bioinformatics/11.4.379. Comput Appl Biosci. 1995. PMID: 8521047
-
Conservation of the LexA repressor binding site in Deinococcus radiodurans.J Integr Bioinform. 2008 Jan 24;5(1). doi: 10.2390/biecoll-jib-2008-86. J Integr Bioinform. 2008. PMID: 20134056
-
Identification of additional genes belonging to the LexA regulon in Escherichia coli.Mol Microbiol. 2000 Mar;35(6):1560-72. doi: 10.1046/j.1365-2958.2000.01826.x. Mol Microbiol. 2000. PMID: 10760155
-
Characterisation of the promoter for the LexA regulated sulA gene of Escherichia coli.Mol Gen Genet. 1983;189(3):400-4. doi: 10.1007/BF00325901. Mol Gen Genet. 1983. PMID: 6306396
-
Consensus sequence Zen.Appl Bioinformatics. 2002;1(3):111-9. Appl Bioinformatics. 2002. PMID: 15130839 Free PMC article. Review.
Cited by
-
Quantitative dissection of transcription in development yields evidence for transcription-factor-driven chromatin accessibility.Elife. 2020 Oct 19;9:e56429. doi: 10.7554/eLife.56429. Elife. 2020. PMID: 33074101 Free PMC article.
-
Comparison between Timelines of Transcriptional Regulation in Mammals, Birds, and Teleost Fish Somitogenesis.PLoS One. 2016 May 18;11(5):e0155802. doi: 10.1371/journal.pone.0155802. eCollection 2016. PLoS One. 2016. PMID: 27192554 Free PMC article.
-
Identification of cis-regulatory modules in promoters of human genes exploiting mutual positioning of transcription factors.Nucleic Acids Res. 2013 Oct;41(19):8822-41. doi: 10.1093/nar/gkt578. Epub 2013 Aug 2. Nucleic Acids Res. 2013. PMID: 23913413 Free PMC article.
-
Sigma-2: Multiple sequence alignment of non-coding DNA via an evolutionary model.BMC Bioinformatics. 2010 Sep 16;11:464. doi: 10.1186/1471-2105-11-464. BMC Bioinformatics. 2010. PMID: 20846408 Free PMC article.
-
Genome-wide identification of alternatively spliced mRNA _targets of specific RNA-binding proteins.PLoS One. 2007 Jun 13;2(6):e520. doi: 10.1371/journal.pone.0000520. PLoS One. 2007. PMID: 17565373 Free PMC article.