Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 1987 Oct;1(2):113-25.
doi: 10.1016/0888-7543(87)90003-6.

Origin of the human L1 elements: proposed progenitor genes deduced from a consensus DNA sequence

Affiliations

Origin of the human L1 elements: proposed progenitor genes deduced from a consensus DNA sequence

A F Scott et al. Genomics. 1987 Oct.

Abstract

A consensus sequence for the human long interspersed repeated DNA element, L1Hs (LINE or KpnI sequence), is presented. The sequence contains two open reading frames (ORFs) which are homologous to ORFs in corresponding regions of L1 elements in other species. The L1Hs ORFs are separated by a small evolutionarily nonconserved region. The 5' end of the consensus contains frequent terminators in all three reading frames and has a relatively high GC content with numerous stretches of weak homology with AluI repeats. The 5' ORF extends for a minimum of 723 bp (241 codons). The 3' ORF is 3843 bp (1281 codons) and predicts a protein of 149 kD which has regions of weak homology to the polymerase domain of various reverse transcriptases. The 3' end of the consensus has a 208-bp nonconserved region followed by an adenine-rich end. The organization of the L1Hs consensus sequence resembles the structure of eukaryotic mRNAs except for the noncoding region between ORFs. However, due to base substitutions or truncation most elements appear incapable of producing mRNA that can be translated. Our observation that individual elements cluster into subfamilies on the basis of the presence or absence of blocks of sequence, or by the linkage of alternative bases at multiple positions, suggests that most L1 sequences were derived from a small number of structural genes. An estimate of the mammalian L1 substitution rate was derived and used to predict the age of individual human elements. From this it follows that the majority of human L1 sequences have been generated within the last 30 million years. The human elements studied here differ from each other, yet overall the L1Hs sequences demonstrate a pattern of species-specificity when compared to the L1 families of other mammals. Possible mechanisms that may account for the origin and evolution of the L1 family are discussed. These include pseudogene formation (retroposition), transposition, gene conversion, and RNA recombination.

PubMed Disclaimer

Similar articles

Cited by

References

    1. Adams J.W., Kaufman R.E., Kretschmer P.J., Harrison M., Nienhuis A.W. A family of long reiterated DNA sequences, one copy of which is next to the human beta globin gene. Nucleic Acids Res. 1980;8:6113–6128. - PMC - PubMed
    1. Barnes W.M., Bevan M. Kilo-sequencing: An ordered strategy for rapid DNA sequence data acquisition. Nucleic Acids Res. 1983;11:349–368. - PMC - PubMed
    1. Bernstein L.B., Mount S.M., Weiner A.M. Pseudogenes for human small nuclear RNA U3 appear to arise by integration of self-primed reverse transcripts of the RNA into new chromosomal sites. Cell. 1983;32:461–472. - PubMed
    1. Burton F.H., Loeb D.D., Voliva C.F., Martin S.L., Edgell M.H., Hutchison C.A. Conservation throughout mammalia and extensive protein-coding capacity of the highly repeated DNA long interspersed sequence one. J. Mol. Biol. 1986;187:291–304. - PubMed
    1. Citron B.A., Chaudary P.V., Rao D.N., Kaufman S. Evidence for transcription and potential translation of the human 1.9 kb HindIII repetitive element. Nucleic Acids Res. 1986;14:3137–3142. - PMC - PubMed

Publication types

  NODES
coding 2
HOME 2
Javascript 1
os 23
text 12
twitter 2
web 4