RADAR: a rigorously annotated database of A-to-I RNA editing

Abstract

We present RADAR—a rigorously annotated database of A-to-I RNA editing (available at http://RNAedit.com). The identification of A-to-I RNA editing sites has been dramatically accelerated in the past few years by high-throughput RNA sequencing studies. RADAR includes a comprehensive collection of A-to-I RNA editing sites identified in humans (Homo sapiens), mice (Mus musculus) and flies (Drosophila melanogaster), together with extensive manually curated annotations for each editing site. RADAR also includes an expandable listing of tissue-specific editing levels for each editing site, which will facilitate the assignment of biological functions to specific editing sites.

INTRODUCTION

RNA editing is the post- or co-transcriptional modification of RNA nucleotides from their genome-encoded sequence. The most common type of editing in metazoans is the deamination of adenosine into inosine (A-to-I) catalyzed by the adenosine deaminase acting on RNA (ADAR) family of enzymes (1). ADAR enzymes bind double-stranded regions of RNA molecules and deaminate adenosine into inosine, which is subsequently recognized as guanosine by the cellular machinery. ADARs perform critical functions in the nervous system (2), and knockout of ADARs in mice causes lethality (1).

Historically, the identification of A-to-I editing sites has been dependent on the sequencing technologies available at the time. When DNA sequencing technologies were first being developed and automated, the identification of editing sites was slow and often occurred serendipitously. The development and growth of nucleotide databases facilitated the identification of additional editing sites. In recent years, the advent of high-throughout RNA sequencing (RNA-seq) has enabled transcriptome-wide identification of RNA editing sites and has greatly accelerated the discovery of A-to-I editing sites.

The major challenges in the field are to understand how RNA editing is regulated and to assign biological functions to specific editing sites. Currently, the widely used database of A-to-I editing sites is the database of RNA editing (DARNED) (http://darned.ucc.ie) (3). Although DARNED is a centralized repository for the location of A-to-I editing sites in the transcriptome, it contains few manually curated annotations and does not contain any information at all about the dynamic regulation of editing sites. RNA editing is tightly regulated in a spatiotemporal manner (4), and to elucidate the function of a particular editing site, it will be vital to analyze tissue-specific editing levels. We designed a rigorously annotated database of A-to-I RNA editing (RADAR) with this goal in mind. First and foremost, RADAR is an updated repository of A-to-I editing sites in humans, mice and flies. We included detailed manually curated annotations for each editing site as described later (see Database Features). In addition, for each editing site, we included a catalog of tissue-specific editing levels from published RNA-seq datasets. As further RNA-seq studies are published, the number of identified editing sites as well as the catalog of tissue-specific editing levels will be continuously updated to facilitate a deeper understanding of how RNA editing is dynamically regulated.

Data collection

We collected a list of A-to-I editing sites in humans, mice and flies after performing a literature search. The first mammalian A-to-I editing sites were identified as amino acid recoding modifications in glutamate and serotonin receptors in the nervous system (5–7). As nucleotide sequences began to be deposited in expressed sequence tag (EST) databases, these resources were mined to identify additional A-to-I editing sites, focusing on editing events that changed amino acid sequences (8–12). EST database mining also demonstrated that A-to-I editing is quite prevalent in human Alu repeats (13,14). Additionally, a biochemical method to identify inosine in RNA molecules was developed by Sakurai et al. (15) and used to identify ∼5000 editing sites.

The vast majority of A-to-I editing sites have been identified in the past 2 years using high-throughput RNA-seq technologies. In humans, we first applied high-throughput sequencing to study A-to-I RNA editing by using a combination of _targeted capture with padlock probes and high-throughput sequencing to identify several hundred editing sites (16). This success was followed by efforts to identify RNA editing sites in an unbiased transcriptome-wide manner by comparing sequence differences between matched RNA and DNA sequencing of a single individual. The first of these efforts (17) was controversial in that it claimed to provide evidence to support RNA editing of all 12 possible mismatch types, but further analyses (18–22) demonstrated that these non-canonical editing mismatches were false positives. Subsequent studies by us and others (23–26) developed meticulous computational pipelines to accurately identify A-to-I editing sites from matched RNA and DNA sequencing of human cell lines while minimizing technical artifacts from sequencing or read mapping errors. More recently, we developed a method to identify RNA editing sites using RNA-seq data alone by comparing transcriptome variants between different individuals (27). We used this method to identify A-to-I editing sites using RNA-seq data from human primary tissues whose genome sequencing data were not available (27). In total, at the time of first release, RADAR contains information describing 1 379 403 human A-to-I RNA editing sites.

In mice, Neeman et al. (28) identified clustered RNA editing sites from EST databases, and Danecek et al. (29) identified RNA editing sites using matched RNA and DNA sequencing data from brain tissues of 15 inbred mouse lines. In flies, Graveley et al. (30) identified RNA editing sites using RNA sequencing data from the modENCODE consortium, Rodriguez et al. (31) identified RNA editing sites using sequencing of nascent RNA transcripts and we (27) identified RNA editing sites using a comparative transcriptome method between three different Drosophila species. In total, at the time of first release, RADAR contains information describing 8108 mouse and 2698 fly A-to-I RNA editing sites.

Database features

The genomic coordinates for all editing sites were first mapped onto the latest genome assemblies (human–hg19, mouse–mm9 and fly–dm3) using the liftOver tool from the University of California, Santa Cruz (UCSC) genome browser (32). For each editing site, we manually curated annotations, which consist of the genome assembly strand, associated gene, functional region within the gene (coding sequence, untranslated region, intron), associated repetitive element, conservation of editing to other species and the reference study in which the site was first identified.

We designed a user-friendly web interface to query the database. The search page is displayed in Figure 1. Users must choose a species (human, mouse or fly) to search within. Users can filter their desired search using any combination of the listed annotations consisting of location in genome, gene, genic location (non-synonymous, synonymous, 5′-UTR, 3′-UTR, non-coding RNA, intronic, intergenic), repetitive element (Alu, repetitive non-Alu, nonrepetitive) and editing conservation (chimpanzee, rhesus and/or mouse for human editing sites and human for mouse editing sites). To facilitate more detailed searches, we have made the entire database contents available as flat files on the Download web page.

Figure 1.

RADAR search page. Users can search for A-to-I editing sites in humans, mice or flies by any combination of the provided annotations: genomic location, gene, genic location, repetitive element overlap and editing conservation.

Open in new tab Download slide

An example results page is displayed in Figure 2. The search parameters are repeated across the top of the page. Information about each editing site is displayed in a single row consisting of nine columns: chromosome, position, gene, strand, genic region, repetitive element, conservation, reference and editing levels. Clicking on the ‘position’ column will direct the user to this location in UCSC genome browser displaying the overlapping gene annotations, genomic nucleotide conservation, overlapping SNP database entries and overlapping repetitive elements. Clicking on an organism under the conservation column will direct the user to the UCSC genome browser location of the conserved editing site in the selected organism. Clicking on the reference column will direct the user to the PubMed abstract for the selected study. Users can download their search results as a tab-delimited text file by clicking on the ‘Download results’ button. A more detailed explanation of the results page can be found on the Tutorial web page.

Figure 2.

Example of a RADAR search result. A search of human non-synonymous editing sites in the HTR2C gene is displayed. Hyperlinks exist in the following four columns: position, conservation, reference and editing levels. (1) Clicking on the position column will direct the user to the location of the editing site in the UCSC browser. (2) Clicking on a species name in the conservation column will direct the user to the location of the conserved editing site in the UCSC browser. (3) Clicking on the reference column will direct the user to the PubMed abstract for the study that identified the editing site. (4) Clicking on the editing level column will direct the user to tissue-specific editing level measurements for the editing site.

Open in new tab Download slide

Tissue-specific editing levels from RNA-seq data (23,25–27,29–31) are available by clicking on the ‘link’ in the ‘editing levels’ column. The information from a single experiment is displayed in each row, which consists of four columns: link to the PubMed abstract for that study, tissue studied, sequencing coverage and editing level. At the time of first release, RADAR contains 1 343 464 human, 7272 mouse and 3155 fly tissue-specific editing level measurements of 975 734 human, 7272 mouse and 2698 fly editing sites, respectively.

Database architecture and web interface

RADAR was built using the Django web framework coupled with a backend MySQL database. The web page was published using an Apache server hosted by Amazon Web Services. RADAR is freely accessible at http://RNAedit.com.

DISCUSSION AND FUTURE DIRECTIONS

The recent boom in A-to-I editing site identification has necessitated the development of RNA editing databases to help elucidate the biological functions of specific editing sites. The major advantages of RADAR over DARNED are the comprehensive compilation of A-to-I editing sites, the curation of extensive annotations and the gathering of tissue-specific editing level measurements for each editing site. RADAR contains ∼1.4 million human editing sites, which is a substantial increase over the ∼600 000 editing sites in DARNED. Furthermore, RADAR allows users to search for specific subsets of editing sites using any combination of five annotations: genomic location, gene, genic location, repetitive elements and/or editing conservation, whereas DARNED searches are restricted to sequence context or any combination of three annotations: genomic location, gene and genic location. Finally, the catalog of tissue-specific editing levels will help shed light on which biological contexts each editing site may be involved in. The major advantages of DARNED over RADAR are implementation of sequence-based searches, dbSNP identifiers and links to Wikipedia annotations. We are open to implementing similar features in RADAR if so requested by users.

We anticipate that the continued development of high-throughput sequencing technologies will result in numerous new investigations into A-to-I editing in various physiological and pathological contexts. Recent evidence has already linked dysfunction of A-to-I editing with a myriad of human diseases such as cancer (33) and autoimmune disorders (34). As more data are generated and included, RADAR will provide a centralized repository providing information on the locations and dynamic regulation of A-to-I editing sites in the transcriptome of metazoans.

FUNDING

Stanford Genome Training Program and Stanford Graduate Fellowship (to G.R.). The U.S. National Institutes of Health [GM102484 to J.B.L.]. Funding for open access charge: National Institutes of Health.

Conflict of interest statement. None declared.

ACKNOWLEDGEMENTS

The authors thank Jung-Ki Yoon for assistance with data collection and Tricia Deng for assistance with web page styling. They are grateful to colleagues in the RNA editing community and members of the Li Lab for helpful suggestions.

REFERENCES

Nishikura

Functions and regulation of RNA editing by ADAR deaminases

Ann. Rev. Biochem.

2010

, vol.

(pg.

321

349

)

Google Scholar

Crossref

WorldCat

Rosenthal

Seeburg

A-to-I RNA editing: effects on proteins key to neural excitability

Neuron

2012

, vol.

(pg.

432

439

)

Kiran

O'Mahony

Sanjeev

Baranov

Darned in 2013: inclusion of model organisms and linking with Wikipedia

Nucleic Acids Res.

2013

, vol.

(pg.

D258

D261

)

Wahlstedt

Daniel

Enstero

Ohman

Large-scale mRNA sequencing determines global regulation of RNA editing during brain development

Genome Res.

2009

, vol.

(pg.

978

986

)

Barbon

Barlati

Genomic organization, proposed alternative splicing mechanisms, and RNA editing structure of GRIK1

Cytogenet. Cell Genet

2000

, vol.

(pg.

236

239

)

Burns

Chu

Rueter

Hutchinson

Canton

Sanders-Bush

Emeson

Regulation of serotonin-2C receptor G-protein coupling by RNA editing

Nature

1997

, vol.

387

(pg.

303

308

)

Sommer

Kohler

Sprengel

Seeburg

RNA editing in brain controls a determinant of ion flow in glutamate-gated channels

Cell

1991

, vol.

(pg.

)

Bhalla

Rosenthal

Holmgren

Reenan

Control of human potassium channel inactivation by editing of a small mRNA hairpin

Nat. Struct. Mol. Biol.

2004

, vol.

(pg.

950

956

)

Clutterbuck

Leroy

O'Connell

Semple

A bioinformatic screen for novel A-I RNA editing sites reveals recoding editing in BC10

Bioinformatics

2005

, vol.

(pg.

2590

2595

)

Gommans

Tatalias

Sie

Dupuis

Vendetti

Smith

Kaushal

Maas

Screening of human SNP database identifies recoding sites of A-to-I RNA editing

RNA

2008

, vol.

(pg.

2074

2085

)

Levanon

Hallegger

Kinar

Shemesh

Djinovic-Carugo

Rechavi

Jantsch

Eisenberg

Evolutionarily conserved human _targets of adenosine to inosine RNA editing

Nucleic Acids Res.

2005

, vol.

(pg.

1162

1168

)

Ohlson

Pedersen

Haussler

Ohman

Editing modifies the GABA(A) receptor subunit alpha3

RNA

2007

, vol.

(pg.

698

703

)

Carmi

Borukhov

Levanon

Identification of widespread ultra-edited human RNAs

PLoS Genet.

2011

, vol.

pg.

e1002317

Levanon

Eisenberg

Yelin

Nemzer

Hallegger

Shemesh

Fligelman

Shoshan

Pollock

Sztybel

et al.

Systematic identification of abundant A-to-I editing sites in the human transcriptome

Nat. Biotechnol.

2004

, vol.

(pg.

1001

1005

)

Sakurai

Yano

Kawabata

Ueda

Suzuki

Inosine cyanoethylation identifies A-to-I RNA editing sites in the human transcriptome

Nat. Chem. Biol.

2010

, vol.

(pg.

733

740

)

Levanon

Yoon

Aach

Xie

Leproust

Zhang

Gao

Church

Genome-wide identification of human RNA editing sites by parallel DNA capturing and sequencing

Science

2009

, vol.

324

(pg.

1210

1213

)

Wang

Bruzel

Richards

Toung

Cheung

Widespread RNA and DNA sequence differences in the human transcriptome

Science

2011

, vol.

333

(pg.

)

Kleinman

Majewski

Comment on “Widespread RNA and DNA sequence differences in the human transcriptome”

Science

2012

, vol.

335

1302; author reply 1302

Google Scholar

OpenURL Placeholder Text

WorldCat

Lin

Piskol

Tan

Comment on “Widespread RNA and DNA sequence differences in the human transcriptome”

Science

2012

, vol.

335

1302; author reply 1302

Google Scholar

OpenURL Placeholder Text

WorldCat

Pickrell

Gilad

Pritchard

Comment on “Widespread RNA and DNA sequence differences in the human transcriptome”

Science

2012

, vol.

335

1302; author reply 1302

Google Scholar

OpenURL Placeholder Text

WorldCat

Piskol

Peng

Wang

Lack of evidence for existence of noncanonical RNA editing

Nat. Biotechnol.

2013

, vol.

(pg.

)

Schrider

Gout

Hahn

Very few RNA and DNA sequence differences in the human transcriptome

PLoS One

2011

, vol.

pg.

e25842

Bahn

Lee

Greer

Peng

Xiao

Accurate identification of A-to-I RNA editing in human by transcriptome sequencing

Genome Res.

2012

, vol.

(pg.

142

150

)

Kleinman

Adoue

Majewski

RNA editing of protein sequences: a rare event in human transcriptomes

RNA

2012

, vol.

(pg.

1586

1596

)

Peng

Cheng

Tan

Kang

Tian

Zhu

Zhang

Liang

Tan

et al.

Comprehensive analysis of RNA-Seq data reveals extensive RNA editing in a human transcriptome

Nat. Biotechnol.

2012

, vol.

(pg.

253

260

)

Ramaswami

Lin

Piskol

Tan

Davis

Accurate identification of human Alu and non-Alu RNA editing sites

Nat. Methods

2012

, vol.

(pg.

579

581

)

Ramaswami

Zhang

Piskol

Keegan

Deng

O'Connell

Identifying RNA editing sites using RNA sequencing data alone

Nat. Methods

2013

, vol.

(pg.

128

132

)

Neeman

Levanon

Jantsch

Eisenberg

RNA editing level in the mouse is determined by the genomic repeat repertoire

RNA

2006

, vol.

(pg.

1802

1809

)

Danecek

Nellaker

McIntyre

Buendia-Buendia

Bumpstead

Ponting

Flint

Durbin

Keane

Adams

High levels of RNA-editing site conservation amongst 15 laboratory mouse strains

Genome Biol.

2012

, vol.

pg.

Graveley

Brooks

Carlson

Duff

Landolin

Yang

Artieri

van Baren

Boley

Booth

et al.

The developmental transcriptome of Drosophila melanogaster

Nature

2011

, vol.

471

(pg.

473

479

)

Rodriguez

Menet

Rosbash

Nascent-seq indicates widespread cotranscriptional RNA editing in Drosophila

Mol. Cell

2012

, vol.

(pg.

)

Meyer

Zweig

Hinrichs

Karolchik

Kuhn

Wong

Sloan

Rosenbloom

Roe

Rhead

et al.

The UCSC Genome Browser database: extensions and updates 2013

Nucleic Acids Res.

2013

, vol.

(pg.

D64

)

Chen

Lin

Chan

Chow

Song

Liu

Yuan

Kong

et al.

Recoding RNA editing of AZIN1 predisposes to hepatocellular carcinoma

Nat. Med.

2013

, vol.

(pg.

209

216

)

Rice

Kasher

Forte

Mannion

Greenwood

Szynkiewicz

Dickerson

Bhaskar

Zampini

Briggs

et al.

Mutations in ADAR1 cause Aicardi-Goutieres syndrome associated with a type I interferon signature

Nat. Genet.

2012

, vol.

(pg.

1243

1248

)

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Download all slides

Month:	Total Views:
November 2016	8
December 2016	10
January 2017	15
February 2017	25
March 2017	18
April 2017	38
May 2017	37
June 2017	34
July 2017	47
August 2017	39
September 2017	27
October 2017	25
November 2017	37
December 2017	75
January 2018	104
February 2018	64
March 2018	93
April 2018	84
May 2018	162
June 2018	98
July 2018	75
August 2018	83
September 2018	65
October 2018	84
November 2018	80
December 2018	71
January 2019	63
February 2019	76
March 2019	98
April 2019	101
May 2019	81
June 2019	95
July 2019	90
August 2019	84
September 2019	85
October 2019	91
November 2019	103
December 2019	88
January 2020	86
February 2020	72
March 2020	69
April 2020	57
May 2020	69
June 2020	54
July 2020	71
August 2020	108
September 2020	90
October 2020	105
November 2020	173
December 2020	73
January 2021	93
February 2021	83
March 2021	88
April 2021	71
May 2021	69
June 2021	52
July 2021	40
August 2021	63
September 2021	61
October 2021	60
November 2021	72
December 2021	68
January 2022	67
February 2022	69
March 2022	104
April 2022	102
May 2022	50
June 2022	70
July 2022	102
August 2022	85
September 2022	85
October 2022	124
November 2022	118
December 2022	94
January 2023	92
February 2023	120
March 2023	140
April 2023	79
May 2023	65
June 2023	80
July 2023	85
August 2023	72
September 2023	91
October 2023	104
November 2023	68
December 2023	92
January 2024	119
February 2024	95
March 2024	123
April 2024	97
May 2024	60
June 2024	76
July 2024	86
August 2024	63
September 2024	97
October 2024	96
November 2024	100

Article Contents

RADAR: a rigorously annotated database of A-to-I RNA editing

Abstract

INTRODUCTION

Data collection

Database features

Database architecture and web interface

DISCUSSION AND FUTURE DIRECTIONS

FUNDING

ACKNOWLEDGEMENTS

REFERENCES

Comments

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

Article Contents

RADAR: a rigorously annotated database of A-to-I RNA editing

Abstract

INTRODUCTION

Data collection

Database features

Database architecture and web interface

DISCUSSION AND FUTURE DIRECTIONS

FUNDING

ACKNOWLEDGEMENTS

REFERENCES

Comments

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only