The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression
- PMID: 22955988
- PMCID: PMC3431493
- DOI: 10.1101/gr.132159.111
The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression
Abstract
The human genome contains many thousands of long noncoding RNAs (lncRNAs). While several studies have demonstrated compelling biological and disease roles for individual examples, analytical and experimental approaches to investigate these genes have been hampered by the lack of comprehensive lncRNA annotation. Here, we present and analyze the most complete human lncRNA annotation to date, produced by the GENCODE consortium within the framework of the ENCODE project and comprising 9277 manually annotated genes producing 14,880 transcripts. Our analyses indicate that lncRNAs are generated through pathways similar to that of protein-coding genes, with similar histone-modification profiles, splicing signals, and exon/intron lengths. In contrast to protein-coding genes, however, lncRNAs display a striking bias toward two-exon transcripts, they are predominantly localized in the chromatin and nucleus, and a fraction appear to be preferentially processed into small RNAs. They are under stronger selective pressure than neutrally evolving sequences-particularly in their promoter regions, which display levels of selection comparable to protein-coding genes. Importantly, about one-third seem to have arisen within the primate lineage. Comprehensive analysis of their expression in multiple human organs and brain regions shows that lncRNAs are generally lower expressed than protein-coding genes, and display more tissue-specific expression patterns, with a large fraction of tissue-specific lncRNAs expressed in the brain. Expression correlation analysis indicates that lncRNAs show particularly striking positive correlation with the expression of antisense coding genes. This GENCODE annotation represents a valuable resource for future studies of lncRNAs.
Figures
Similar articles
-
GENCODE: the reference human genome annotation for The ENCODE Project.Genome Res. 2012 Sep;22(9):1760-74. doi: 10.1101/gr.135350.111. Genome Res. 2012. PMID: 22955987 Free PMC article.
-
Evolutionary annotation of conserved long non-coding RNAs in major mammalian species.Sci China Life Sci. 2015 Aug;58(8):787-98. doi: 10.1007/s11427-015-4881-9. Epub 2015 Jun 27. Sci China Life Sci. 2015. PMID: 26117828
-
Long noncoding RNA repertoire in chicken liver and adipose tissue.Genet Sel Evol. 2017 Jan 10;49(1):6. doi: 10.1186/s12711-016-0275-0. Genet Sel Evol. 2017. PMID: 28073357 Free PMC article.
-
Long noncoding RNAs and tumorigenesis: genetic associations, molecular mechanisms, and therapeutic strategies.Tumour Biol. 2016 Jan;37(1):163-75. doi: 10.1007/s13277-015-4445-4. Epub 2015 Nov 19. Tumour Biol. 2016. PMID: 26586396 Review.
-
[Non-coding Natural Antisense RNA: Mechanisms of Action in the Regulation of _target Gene Expression and Its Clinical Implications].Yakugaku Zasshi. 2020;140(5):687-700. doi: 10.1248/yakushi.20-00002. Yakugaku Zasshi. 2020. PMID: 32378673 Review. Japanese.
Cited by
-
Chromatin-bound RNA and the neurobiology of psychiatric disease.Neuroscience. 2014 Apr 4;264:131-41. doi: 10.1016/j.neuroscience.2013.06.051. Epub 2013 Jul 3. Neuroscience. 2014. PMID: 23831425 Free PMC article. Review.
-
Identification of Long Non-coding RNA Isolated From Naturally Infected Macrophages and Associated With Bovine Johne's Disease in Canadian Holstein Using a Combination of Neural Networks and Logistic Regression.Front Vet Sci. 2021 Apr 22;8:639053. doi: 10.3389/fvets.2021.639053. eCollection 2021. Front Vet Sci. 2021. PMID: 33969037 Free PMC article.
-
Systematic chemical and molecular profiling of MLL-rearranged infant acute lymphoblastic leukemia reveals efficacy of romidepsin.Leukemia. 2017 Jan;31(1):40-50. doi: 10.1038/leu.2016.165. Epub 2016 Jun 13. Leukemia. 2017. PMID: 27443263 Free PMC article.
-
Long non-coding RNAs and complex diseases: from experimental results to computational models.Brief Bioinform. 2017 Jul 1;18(4):558-576. doi: 10.1093/bib/bbw060. Brief Bioinform. 2017. PMID: 27345524 Free PMC article. Review.
-
Long non‑coding RNAs as diagnostic and prognostic biomarkers for colorectal cancer (Review).Oncol Lett. 2024 Aug 8;28(4):486. doi: 10.3892/ol.2024.14619. eCollection 2024 Oct. Oncol Lett. 2024. PMID: 39185489 Free PMC article. Review.
References
-
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ 1990. Basic local alignment search tool. J Mol Biol 215: 403–410 - PubMed
Publication types
MeSH terms
Substances
Associated data
- Actions
- Actions
- Actions
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources