Identification of alternate polyadenylation sites and analysis of their tissue distribution using EST data
- PMID: 11544195
- PMCID: PMC311108
- DOI: 10.1101/gr.190501
Identification of alternate polyadenylation sites and analysis of their tissue distribution using EST data
Abstract
Alternate polyadenylation affects a large fraction of higher eucaryote mRNAs, producing mature transcripts with 3' ends of variable length. This variation is poorly represented in the current transcript catalogs derived from whole genome sequences, mostly because such posttranscriptional events are not detectable directly at the DNA level. Alternate polyadenylation of an mRNA is better understood by comparison to EST databases. Comparing ESTs to mRNAs, however, is a difficult task subjected to the pitfalls of internal priming, presence of intron sequences, repeated elements, chimerical ESTs or matches with EST from paralogous genes. We present here a computer program that addresses these problems and displays ESTs matches to a query mRNA sequence to predict alternate polyadenylation and to suggest library-specific forms. The output highlights effective polyadenylation signals, possible sources of artifacts such as A-rich stretches in the mRNA sequences, and allows for a direct visualization of EST libraries using color codes. Statistical biases in the distribution of alternative mRNA forms among EST libraries were systematically sought. About 1450 human and 200 mouse mRNAs displayed such biases, suggesting in each case a tissue- or disease-specific regulation of polyadenylation.
Figures
Similar articles
-
In silico analysis of EST and genomic sequences allowed the prediction of cis-regulatory elements for Entamoeba histolytica mRNA polyadenylation.Comput Biol Chem. 2008 Aug;32(4):256-63. doi: 10.1016/j.compbiolchem.2008.03.019. Epub 2008 Apr 12. Comput Biol Chem. 2008. PMID: 18514032
-
Conservation of alternative polyadenylation patterns in mammalian genes.BMC Genomics. 2006 Jul 26;7:189. doi: 10.1186/1471-2164-7-189. BMC Genomics. 2006. PMID: 16872498 Free PMC article.
-
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].Yi Chuan Xue Bao. 2004 May;31(5):431-43. Yi Chuan Xue Bao. 2004. PMID: 15478601 Chinese.
-
Identification and characterization of polyadenylation signal (PAS) variants in human genomic sequences based on modified EST clustering.In Silico Biol. 2008;8(3-4):347-61. In Silico Biol. 2008. PMID: 19032167
-
Alternate polyadenylation in human mRNAs: a large-scale analysis by EST clustering.Genome Res. 1998 May;8(5):524-30. doi: 10.1101/gr.8.5.524. Genome Res. 1998. PMID: 9582195
Cited by
-
Tumor suppressor miR-317 and lncRNA Peony are expressed from a polycistronic non-coding RNA locus that regulates germline differentiation and testis morphology.bioRxiv [Preprint]. 2024 Oct 10:2024.10.10.617551. doi: 10.1101/2024.10.10.617551. bioRxiv. 2024. PMID: 39416153 Free PMC article. Preprint.
-
Spatially revealed roles for lncRNAs in Drosophila spermatogenesis, Y chromosome function and evolution.Nat Commun. 2024 May 7;15(1):3806. doi: 10.1038/s41467-024-47346-w. Nat Commun. 2024. PMID: 38714658 Free PMC article.
-
Known sequence features explain half of all human gene ends.NAR Genom Bioinform. 2023 Apr 5;5(2):lqad031. doi: 10.1093/nargab/lqad031. eCollection 2023 Jun. NAR Genom Bioinform. 2023. PMID: 37035540 Free PMC article.
-
Beyond Genes: Inclusion of Alternative Splicing and Alternative Polyadenylation to Assess the Genetic Architecture of Predisposition to Voluntary Alcohol Consumption in Brain of the HXB/BXH Recombinant Inbred Rat Panel.Front Genet. 2022 Mar 15;13:821026. doi: 10.3389/fgene.2022.821026. eCollection 2022. Front Genet. 2022. PMID: 35368676 Free PMC article.
-
Genome annotation with long RNA reads reveals new patterns of gene expression and improves single-cell analyses in an ant brain.BMC Biol. 2021 Nov 27;19(1):254. doi: 10.1186/s12915-021-01188-w. BMC Biol. 2021. PMID: 34838024 Free PMC article.
References
-
- Agresti A. A survey of exact inference for contingency tables. Stat Sci. 1992;7:131–153.
-
- Boguski MS, Lowe TM, Tolstoshev CM. dbEST—database for expressed sequence tags. Nat Genet. 1993;4:332–333. - PubMed
-
- Colgan DF, Manley JL. Mechanism and regulation of mRNA polyadenylation. Genes & Dev. 1997;11:2755–2766. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials