Annotation of functional variation in personal genomes using RegulomeDB
- PMID: 22955989
- PMCID: PMC3431494
- DOI: 10.1101/gr.137323.112
Annotation of functional variation in personal genomes using RegulomeDB
Abstract
As the sequencing of healthy and disease genomes becomes more commonplace, detailed annotation provides interpretation for individual variation responsible for normal and disease phenotypes. Current approaches focus on direct changes in protein coding genes, particularly nonsynonymous mutations that directly affect the gene product. However, most individual variation occurs outside of genes and, indeed, most markers generated from genome-wide association studies (GWAS) identify variants outside of coding segments. Identification of potential regulatory changes that perturb these sites will lead to a better localization of truly functional variants and interpretation of their effects. We have developed a novel approach and database, RegulomeDB, which guides interpretation of regulatory variants in the human genome. RegulomeDB includes high-throughput, experimental data sets from ENCODE and other sources, as well as computational predictions and manual annotations to identify putative regulatory potential and identify functional variants. These data sources are combined into a powerful tool that scores variants to help separate functional variants from a large pool and provides a small set of putative sites with testable hypotheses as to their function. We demonstrate the applicability of this tool to the annotation of noncoding variants from 69 full sequenced genomes as well as that of a personal genome, where thousands of functionally associated variants were identified. Moreover, we demonstrate a GWAS where the database is able to quickly identify the known associated functional variant and provide a hypothesis as to its function. Overall, we expect this approach and resource to be valuable for the annotation of human genome sequences.
Figures
Similar articles
-
An Experimental Approach to Genome Annotation: This report is based on a colloquium sponsored by the American Academy of Microbiology held July 19-20, 2004, in Washington, DC.Washington (DC): American Society for Microbiology; 2004. Washington (DC): American Society for Microbiology; 2004. PMID: 33001599 Free Books & Documents. Review.
-
Deep sequencing of Danish Holstein dairy cattle for variant detection and insight into potential loss-of-function variants in protein coding genes.BMC Genomics. 2015 Dec 9;16:1043. doi: 10.1186/s12864-015-2249-y. BMC Genomics. 2015. PMID: 26645365 Free PMC article.
-
Functional annotation signatures of disease susceptibility loci improve SNP association analysis.BMC Genomics. 2014 May 24;15(1):398. doi: 10.1186/1471-2164-15-398. BMC Genomics. 2014. PMID: 24886216 Free PMC article.
-
Incorporating Non-Coding Annotations into Rare Variant Analysis.PLoS One. 2016 Apr 29;11(4):e0154181. doi: 10.1371/journal.pone.0154181. eCollection 2016. PLoS One. 2016. PMID: 27128317 Free PMC article.
-
The genetic basis of systemic lupus erythematosus: What are the risk factors and what have we learned.J Autoimmun. 2016 Nov;74:161-175. doi: 10.1016/j.jaut.2016.08.001. Epub 2016 Aug 10. J Autoimmun. 2016. PMID: 27522116 Review.
Cited by
-
Genetic variants in TNFα, TGFB1, PTGS1 and PTGS2 genes are associated with diisocyanate-induced asthma.J Immunotoxicol. 2016;13(1):119-26. doi: 10.3109/1547691X.2015.1017061. Epub 2015 Sep 4. J Immunotoxicol. 2016. PMID: 25721048 Free PMC article.
-
New loci and coding variants confer risk for age-related macular degeneration in East Asians.Nat Commun. 2015 Jan 28;6:6063. doi: 10.1038/ncomms7063. Nat Commun. 2015. PMID: 25629512 Free PMC article.
-
Dysregulation of vitamin D synthesis pathway genes in colorectal cancer: A case-control study.J Clin Lab Anal. 2021 Feb;35(2):e23617. doi: 10.1002/jcla.23617. Epub 2020 Oct 14. J Clin Lab Anal. 2021. PMID: 33058307 Free PMC article.
-
Data-Driven Modeling of Knowledge Assemblies in Understanding Comorbidity Between Type 2 Diabetes Mellitus and Alzheimer's Disease.J Alzheimers Dis. 2020;78(1):87-95. doi: 10.3233/JAD-200752. J Alzheimers Dis. 2020. PMID: 32925069 Free PMC article.
-
Genome-wide identification of expression quantitative trait loci for human telomerase.Medicine (Baltimore). 2016 Oct;95(42):e5209. doi: 10.1097/MD.0000000000005209. Medicine (Baltimore). 2016. PMID: 27759658 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials