GenBank-Updates@genbank.bio.net (05/17/91)
LOCUS HUMPGEPEB 2863 bp ds-DNA PRI 17-MAY-1991
DEFINITION Human snRNP E protein pseudogene EB.
ACCESSION M65126 M25911
KEYWORDS E protein; pseudogene; small nuclear ribonucleoprotein; snRNP.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2863)
AUTHORS Stanford,D.R., Holicky,E.L., Perry,C.A., Rehder,K.J., Harvey,S.E.,
Rohleder,A.M. and Wieben,E.D.
TITLE The snRNP E protein multigene family contains five pseudogenes with
common mutations
JOURNAL DNA (1991) In press
STANDARD full staff_entry
FEATURES Location/Qualifiers
repeat_unit 999..1012
/rpt_type=direct
repeat_unit 1013..1330
/rpt_family="Alu"
repeat_unit 1328..1341
/rpt_type=direct
repeat_unit 1467..1480
/rpt_type=direct
CDS 1480..1954
/pseudo
/codon_start=1480
promoter 1511..1517
/note="translation initiation sequence"
polyA_signal 1914..1919
polyA_site 1941..1941
repeat_unit 1951..1964
/rpt_type=direct
BASE COUNT 862 a 621 c 600 g 780 t
ORIGIN
1 gagctcattg taagaaacag ttactgccta gtacagcagt ctgttaacta gtctctccat
61 ttttgattgc tccatgcccc aattgttaat gggcattgct aacaccagct tccctacagt
121 gcagttttga gcagtccata aaccagactc cagccccctc agcagtcatt ggtctccttt
181 cctgcttgcc caatcttatc gcctgttgct ccctgtcaaa atgaatattt ctggttcctc
241 agatattttt gctaatacta ttgcttttgc tgggaaactg ttaccacatc ctaaatttcg
301 tatggaaaaa ctttacctaa ctgttaaatt cccagctgag caccgtctcc ttcattaagc
361 cattagtgac tattgcctgt tccaggcgaa tttgtcactc ttttgaacta cagtattttc
421 cacatgccat gggtattatg gttgttgata tttatgtcct atttactaaa caaaagtata
481 cacatatttg aggcaatata agcaagcaaa ctggcagccc acactgtatt ttcagaattg
541 ggattttttt ttaacataaa aatcctgatt ttcatcttct cttgaaaggt cagaagatct
601 agcttacatc ctcaaagaca ggtcagcaag ccacagcctg tggcaaatct gacctggact
661 ctgtttttag aaataaagtt tgattggaac acagccccac ccatttgttt atatattcac
721 agtggatgct ttccttttac tgtggcagag gtgagtagtt gagacagaga ctctatggcc
781 tagaaagcct atttgctatc cagcccttta caaaaaaagt ttgaccaact cctgctacaa
841 ggcaccaacc tgtggagtga tgcacgtact ccaggacctc ttcctactcc atatcagacc
901 tggccagttg caagtattta tcctgtcttc caaggccttt gaaatactag agtttttcat
961 tcttcttaat cctcttgatc atatcctaac acaatatgaa agccattatg ccgggtggct
1021 ctggctagga caagcctgta atccaaacac tttaggaggc tgaggcaggc ggatcacctg
1081 aggtcaggag tttgagacca gcctggccaa tatggtgaaa ccctgtctct actaaaaata
1141 caaaaattag ccaggcatgg tggcacgcat ctgtaaccct agctacttgg gtggctgagg
1201 caggacaatc gcttgaacct gggaggtgaa gtttgcagtg agcaaagatt gtaccactgc
1261 attccaactg cattccagcc tgggcgacag agctagactc tgtctccaaa aaaaaaaaaa
1321 aaaaaaaaaa gccattatgc cataaaggca aggcaatctc taaaataaat attcaaaacg
1381 attacactgc caatataaat taaatgatca actctaatca tagtcctcag taactatagc
1441 tactgataaa gtaggttctt agagttaaaa gcttgaaatc agcgggtggg tgtgctcttt
1501 gtgaaattcc accatggcat accatggcca gggccagaaa gtgcagaagg ttatggtgca
1561 gcccatcaat ctcatcttca gataaactta caaaatagat cgcagattca ggtgtggctc
1621 tatgagcaag tgaatatgcg gatagaaggc tgtatcatta gttttgatga gtatatgaac
1681 cttgtattag atgatacaga agagattcat tctaaaacaa agtcatgaat acgaccgagt
1741 cggatcatgc taaaaggaca taatattact ctgctacgaa gtttctccaa ctagaaatga
1801 acaatgaagt gagaaattgt tgagatggat acagtttgtt tttagatgtt ttttgtccaa
1861 tatgaacatt tattcgtatt gttttgatta cccttatgtt attacaagat ggcaataaat
1921 gctatgggat tgtttgtatt taaaaaaaaa aaaagctcaa agtctaaagt cagattgcct
1981 ggactcatac tccagctgtg ccacctatgt gagctgggtg agtctttgtt tccttgtctg
2041 aatataatga ggataacagc agtgcctacc ttggattgat gaaaggagtc cattgcaact
2101 aatatttagt gatagctact ttgtgtcagg cactgtgcta ggctcttggg agcatcagtg
2161 agcaaaacag ccacatttcc acttagaaga gcttaactta ggaagatgtt gggggaaatg
2221 gagcaggcag cagacaccac tgatacctgt tgcacagcca cttgggccac cactgagttc
2281 agccacagcc ataggggaaa gcttcgactc ttctctggct ccaaagcctt ctccaatgca
2341 gcgatgtagc tggaaccagt ttggctctca ggcaagcaca acctggaagg gaggaattaa
2401 tgcccagagg cagcccccaa caaggtgggc caggagttgg cgatagagga tatggccccc
2461 gtccttagaa gaacaactct gggaggcatt ctgtatgacc ccttgtagag gtcctggagg
2521 ataaagctcc agagtaagga gagacgctag taacacccta catgacttga ctttcctgct
2581 accttccctc tttctggcaa cattatccaa ataaactatc tgtatccaaa cctgtttcat
2641 gtgcggcctt tggaggaaat ccaaagaaga acaaattgta aacaaacaaa aacaaaaaaa
2701 caatacaaaa aagtagtaag tgccatgcag agaatgaaaa caggtgtgaa caaagtgagt
2761 gatcagacac cccctgacaa tggggccatg gaatgcctcc attgagagct gagtgacagg
2821 aggaatccag ccatgccccg gctagaggat gaatactctg cag
//GenBank-Updates@genbank.bio.net (05/22/91)
LOCUS HUMPGEPEB 2863 bp ds-DNA PRI 22-MAY-1991
DEFINITION Human snRNP E protein pseudogene EB.
ACCESSION M65126 M25911
KEYWORDS E protein; pseudogene; small nuclear ribonucleoprotein; snRNP.
SOURCE Human fetal liver DNA, clone EB and peripheral blood leukocyte DNA,
clone LH66.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2863)
AUTHORS Stanford,D.R., Holicky,E.L., Perry,C.A., Rehder,K.J., Harvey,S.E.,
Rohleder,A.M. and Wieben,E.D.
TITLE The snRNP E protein multigene family contains five pseudogenes with
common mutations
JOURNAL DNA Sequence (1991) In press
STANDARD full staff_entry
FEATURES Location/Qualifiers
repeat_unit 999..1012
/rpt_type=direct
repeat_unit 1013..1330
/rpt_family="Alu"
repeat_unit 1328..1341
/rpt_type=direct
repeat_unit 1467..1480
/rpt_type=direct
mRNA 1480..1954
/pseudo=gene
CDS 1514..1585
/pseudo=gene
/codon_start=1514
polyA_signal 1914..1919
polyA_site 1941..1941
repeat_unit 1951..1964
/rpt_type=direct
allele 2780..2780
/note="replace (2780, 'c'), in clone LH66"
BASE COUNT 862 a 621 c 600 g 780 t
ORIGIN
1 gagctcattg taagaaacag ttactgccta gtacagcagt ctgttaacta gtctctccat
61 ttttgattgc tccatgcccc aattgttaat gggcattgct aacaccagct tccctacagt
121 gcagttttga gcagtccata aaccagactc cagccccctc agcagtcatt ggtctccttt
181 cctgcttgcc caatcttatc gcctgttgct ccctgtcaaa atgaatattt ctggttcctc
241 agatattttt gctaatacta ttgcttttgc tgggaaactg ttaccacatc ctaaatttcg
301 tatggaaaaa ctttacctaa ctgttaaatt cccagctgag caccgtctcc ttcattaagc
361 cattagtgac tattgcctgt tccaggcgaa tttgtcactc ttttgaacta cagtattttc
421 cacatgccat gggtattatg gttgttgata tttatgtcct atttactaaa caaaagtata
481 cacatatttg aggcaatata agcaagcaaa ctggcagccc acactgtatt ttcagaattg
541 ggattttttt ttaacataaa aatcctgatt ttcatcttct cttgaaaggt cagaagatct
601 agcttacatc ctcaaagaca ggtcagcaag ccacagcctg tggcaaatct gacctggact
661 ctgtttttag aaataaagtt tgattggaac acagccccac ccatttgttt atatattcac
721 agtggatgct ttccttttac tgtggcagag gtgagtagtt gagacagaga ctctatggcc
781 tagaaagcct atttgctatc cagcccttta caaaaaaagt ttgaccaact cctgctacaa
841 ggcaccaacc tgtggagtga tgcacgtact ccaggacctc ttcctactcc atatcagacc
901 tggccagttg caagtattta tcctgtcttc caaggccttt gaaatactag agtttttcat
961 tcttcttaat cctcttgatc atatcctaac acaatatgaa agccattatg ccgggtggct
1021 ctggctagga caagcctgta atccaaacac tttaggaggc tgaggcaggc ggatcacctg
1081 aggtcaggag tttgagacca gcctggccaa tatggtgaaa ccctgtctct actaaaaata
1141 caaaaattag ccaggcatgg tggcacgcat ctgtaaccct agctacttgg gtggctgagg
1201 caggacaatc gcttgaacct gggaggtgaa gtttgcagtg agcaaagatt gtaccactgc
1261 attccaactg cattccagcc tgggcgacag agctagactc tgtctccaaa aaaaaaaaaa
1321 aaaaaaaaaa gccattatgc cataaaggca aggcaatctc taaaataaat attcaaaacg
1381 attacactgc caatataaat taaatgatca actctaatca tagtcctcag taactatagc
1441 tactgataaa gtaggttctt agagttaaaa gcttgaaatc agcgggtggg tgtgctcttt
1501 gtgaaattcc accatggcat accatggcca gggccagaaa gtgcagaagg ttatggtgca
1561 gcccatcaat ctcatcttca gataaactta caaaatagat cgcagattca ggtgtggctc
1621 tatgagcaag tgaatatgcg gatagaaggc tgtatcatta gttttgatga gtatatgaac
1681 cttgtattag atgatacaga agagattcat tctaaaacaa agtcatgaat acgaccgagt
1741 cggatcatgc taaaaggaca taatattact ctgctacgaa gtttctccaa ctagaaatga
1801 acaatgaagt gagaaattgt tgagatggat acagtttgtt tttagatgtt ttttgtccaa
1861 tatgaacatt tattcgtatt gttttgatta cccttatgtt attacaagat ggcaataaat
1921 gctatgggat tgtttgtatt taaaaaaaaa aaaagctcaa agtctaaagt cagattgcct
1981 ggactcatac tccagctgtg ccacctatgt gagctgggtg agtctttgtt tccttgtctg
2041 aatataatga ggataacagc agtgcctacc ttggattgat gaaaggagtc cattgcaact
2101 aatatttagt gatagctact ttgtgtcagg cactgtgcta ggctcttggg agcatcagtg
2161 agcaaaacag ccacatttcc acttagaaga gcttaactta ggaagatgtt gggggaaatg
2221 gagcaggcag cagacaccac tgatacctgt tgcacagcca cttgggccac cactgagttc
2281 agccacagcc ataggggaaa gcttcgactc ttctctggct ccaaagcctt ctccaatgca
2341 gcgatgtagc tggaaccagt ttggctctca ggcaagcaca acctggaagg gaggaattaa
2401 tgcccagagg cagcccccaa caaggtgggc caggagttgg cgatagagga tatggccccc
2461 gtccttagaa gaacaactct gggaggcatt ctgtatgacc ccttgtagag gtcctggagg
2521 ataaagctcc agagtaagga gagacgctag taacacccta catgacttga ctttcctgct
2581 accttccctc tttctggcaa cattatccaa ataaactatc tgtatccaaa cctgtttcat
2641 gtgcggcctt tggaggaaat ccaaagaaga acaaattgta aacaaacaaa aacaaaaaaa
2701 caatacaaaa aagtagtaag tgccatgcag agaatgaaaa caggtgtgaa caaagtgagt
2761 gatcagacac cccctgacaa tggggccatg gaatgcctcc attgagagct gagtgacagg
2821 aggaatccag ccatgccccg gctagaggat gaatactctg cag
//