GenBank-Updates@genbank.bio.net (05/17/91)
LOCUS HUMPGEP11B 5351 bp ds-DNA PRI 17-MAY-1991
DEFINITION Human snRNP E protein pseudogene 110.
ACCESSION M65235 M25912
KEYWORDS E protein; pseudogene; small nuclear ribonucleoprotein; snRNP.
SOURCE Human DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 5351)
AUTHORS Stanford,D.R., Holicky,E.L., Perry,C.A., Rehder,K.J., Harvey,S.E.,
Rohleder,A.M. and Wieben,E.D.
TITLE The snRNP E protein multigene family contains five pseudogenes with
common mutations
JOURNAL DNA (1991) In press
STANDARD full staff_entry
FEATURES Location/Qualifiers
repeat_unit 604..1087
/rpt_family="KpnI"
repeat_unit 1607..2305
/rpt_family="KpnI"
repeat_unit 2297..2313
/rpt_type=direct
CDS 2313..2775
/pseudo
/codon_start=2313
promoter 2328..2334
/note="translation initiation sequence"
polyA_signal 2721..2726
polyA_site 2747..2747
repeat_unit 2771..2787
/rpt_type=direct
misc_feature 2783..2846
/note="KpnI-like region"
repeat_unit 2835..2845
/rpt_type=direct
repeat_unit 2849..3157
/rpt_family="Alu"
repeat_unit 3153..3163
/rpt_type=direct
repeat_unit 3166..3900
/rpt_family="KpnI"
repeat_unit 3938..3950
/rpt_type=direct
repeat_unit 3950..4249
/rpt_family="Alu"
repeat_unit 4250..4260
/rpt_type=direct
BASE COUNT 1826 a 1025 c 1050 g 1450 t
ORIGIN
1 ccatggccgt ggtgacaggc agccacccac agccaccgca gggacgggag gggagcagtt
61 atgggctggt tacaacttat ggtcacagaa aataaaacgg ggagacgtcc agctctgggt
121 ggaaaagccc catcttttta agcctggaaa ctatgccatg tctccacgtg cgtattaaag
181 atttcccaat tcggaagtca ctgagcacca gtccttctaa tagagcctca gatagatagt
241 ctgtttcttg aagatcaagt ttgaaacttc actctctgga gaaaagaaca ggatatacat
301 gaaatccgca gaaacacaaa gaaccatcct tcaatgtaaa cttaaaaatt acaaaataga
361 agtaaaatag gaagtaactt ttccctccct tcactcaacc accattcacc tgaaatcctt
421 gccgaaacac ttcaaagtgt gtgggggggc tggaggcaag ttgcccggat gcgggaccct
481 catgcactat gaagggagtg caaattgcta cagtcccttc cagaagcgac aagacaatat
541 atatattctc taaatattct agccatttca cccccaaatc cctctgctgg gaacctgccc
601 taaggaagca attcaaagat caaatcgtat gacaaaacat gttcactgtg gtattcccta
661 taaattaaaa gtcaaatgct acagccataa aaaggacgag gtcatgtcct ttgcagggac
721 atgggtgaag ttggaagcca tcatccttag caaactaaca taggaacaga aaaccaaaca
781 ctgcatgttc tcactcataa gtggaagttg aacaatgaga acacatggac acagggcagg
841 ggaacaacac acaccagggg gtgtggggag tggggggaga gagagcatca ggataaatag
901 ctaatgcatg cggggcttaa aacctagatc gatgggttga tatgcgcagc aaaccgccat
961 ggcacatgtt tacctatgta acaaacctgc acattctgca cttgtatccc ggaacttaaa
1021 gtaaaaaaga aataaataaa gtcaaacgca acttaagtat ttaacatctg ggcatcataa
1081 gtaaattatc ttgcatcaaa tcaatataat attatgtagt atttaaaatt gtaattatgc
1141 taagtatagt aacaaagaaa ataacttaca gaaaatatct gtatattgga caaggtcaag
1201 aaggcactcc ccaaaatgaa agtcatttat ttacctttgt atctaacctg tgccttcagc
1261 tttattccag aaattctcag ggaacccttg tacacatgct ttacctagga tgctggaaat
1321 ggggtttatt cctttaaaaa catttttgac aatttattct gagaaaatta tcatgcactc
1381 attgctgtac tgctactatt ttctaaatgg gaatagcata aatgtccaat aataagggat
1441 ttaaaaaata atagataccc aaacaatgga atattgtgct accaatcaaa cttatgttgt
1501 aggagaataa tgacatcaga aaatgtttct gatatattaa atgaagtaaa ttacaaaatc
1561 gatttaaact aggttcgctt ttaaaaggta tatattttta attttaattc cttaatattt
1621 ttcttttata tataataaat gttaattcaa aatggaaaaa gtgttccaga gctgagcctg
1681 ataatttaag aataaaaata tccaagggtt aagagctaaa actataaatt ttgaagaaac
1741 atagttgtaa atctttataa ccttggatta ggaaacaaca gacatgtgac aactaaagaa
1801 aaaatattag ataaactgga aagagaaaag acatggaaga aaattttttc aaatcatatt
1861 tctgataagg gcctaatatc ccaaatctat aaagaactct taaaattcaa caataaaaag
1921 acaacaaatc caattcaaaa atgggcaaag gatttgaata gacatgatcc caagctgtac
1981 aaacagtcaa caagtgcatg aataaatgcc caccgtcatt agttattaaa ggaatataaa
2041 gcaaaaacca caaggagata ccccttcaca cccgttagga ccacttaaag aaagaaagat
2101 atggtctatc ttgtaattgt ctgacaagtt tagttaagga agatgtggag aaactagatc
2161 cctcctgcat ttctagtgag aaccggaaaa tgatgcagcc acttaccaaa ccagtttagc
2221 agttcctcaa aagttaaaca taatgttacc atttgactcg agcaattcca ctgctaggta
2281 tgtacccaag agaactgaaa acatatgtcc tcctcttgtg aaattccacc atggtgtacc
2341 atggccaggg ccagaaagtg caaagggtta tggtgcagcc catcaacctc atcttcagat
2401 acttacaaaa tagatcacgg attcaagtgt agctctatga gcaagtgaat atgaggatag
2461 aaggctgtat cattggtttt gatgagcata tgaaccttgt attagattat gcgaagagat
2521 tcattctaaa acaaagtcaa gaaaacaact gggtcggctc atgctaaaag gagataatat
2581 tcctctgctg caaagtgtct ccaactagaa atgatcaatg aagtgagaaa ttgttgagaa
2641 ggatacagtt tgtttttaga tgtccaatat gaacatttat tcatattgtt ttgattaccc
2701 ttatgttatt acaagatggc aataaatgct gtgggattgt ttgtactaaa aaaaaaaaag
2761 aaagaaaaaa gaaaacatat gtcctcccaa aatttggtac aagaatgttt atgacagcat
2821 gatttataac agtcaaaaag ggaaggaggg ctgggcgttg gtggctacac gtgtaatcct
2881 agcattttgg aaggccaagg tggtggatca cctgaggtca ggagtttaga ggaccagccc
2941 ggccaacatg atgaacatgc ccccacctct actaaaaaca caaaaattag ccgggcttgg
3001 tggtgggcac cttgtaatcc aagatgcttg ggaggctgag gcaggagaat cgcttgaacc
3061 tgggaggcag aggttgcagt gagccaagat ggcgcccatg cactccagcc tggggaccag
3121 agtgagactc tgtctaaaaa ttaataaaaa taaaaaagag aagcaaccca aaagtccact
3181 aactgatgag ttaataaata aaacatgata tatccatatc tataccatgg aatattattt
3241 gccaataaaa agaaagaata attaacgcat gctacaacat aaacaaatca tggaacatta
3301 tgctcagtgc aagaagccag acactaaagg tgtgcattat actctttcat ttgtaggaaa
3361 tacggcaaat ccgtaaagac agaaagtaga tgatcagtgg ttgccagagg ctgggggaag
3421 agggaatggg aagtgaatgc taatgggttg ccgggtttct ttttgaggca gcaaaaatgt
3481 ccttgaatta gatagtgctg atgactgcac acctttgtga atatactaaa aaccactggg
3541 ctgtactttt taaagaggtg aattttatga catatgtaac tatatcccaa ttttaattat
3601 gcaaatatat acaataatgc accacataac agtacttcag tcaatgtcag accacatgta
3661 ccatggccat ccaatgagat tataatgaag ctgaaaagtt cctgtctcct ggtggtgatg
3721 ctatagctgt catcatggca ttaacccaac acattactca cgtgtttaat gtaaacaaac
3781 ctactgcgct accagttgta taaaaatcta gcacatagaa ttatgaatag tacataatac
3841 ttgataatcg ataataaatg actattatca taatatagta tgttgtacta atttatgtac
3901 aattcaatat atgtttatac gtattttaga gtatacttcc tttccaacct tttttttttt
3961 tttgagacgg agcctcgctc tgtcacccaa gactggaggg cagtggtgca atctcggctc
4021 cctgcaacct ctggcctccc aggttgcaag cgattctcct gcctcagcct cccaagcagc
4081 tgggatttta ggcacccgcc accaggccca gctaattttt gtgtttttac tagagaaagg
4141 gtttcaccag gttggccaag ttggtctcaa actcctgacc tcaggtgatc cacccgcctc
4201 gggcctccca aagtgctggg attacaggcc tgagccaccg caccctgcct ccttctactt
4261 attaaaaagc aaaagttaac tataaaacag ccccaggcag gtccatcagg agatatttca
4321 gaggaaggca ttgctatcat accagataga cgtccatgtg tgtttctgcc cctgaagacc
4381 ttccagtggg acaagatgtg gagatgcaag acagtgatat taatgatcct gaccctgggt
4441 gggcctagga taatgtgtgt gtgttttagc ttttaacaaa aaagtcaaat ttttaaaaat
4501 ttttaaatag aaaaaaaacg tatagaataa ggatataaaa tatttttgta gagctgtaca
4561 ttgtgtctgt gtttaagctg ttattacaaa agtcaaagtt gaaaatagaa gtttattagt
4621 aaaatgttac atagctaagg taatttattg aagcagaaat tttttttaat taaatttgaa
4681 accatagctg aatgtgacag tgttacaaag tctacaggaa tgtacactaa tatctaagcc
4741 ttcacactca ctcaccactc actcactcac tcacccaaag caactccagt cctgcaagct
4801 ccattcacgg taagcgtcct agatgggtgt actaggtttt catcttttat gccatattct
4861 tactggacct tttctatatt tatataaaca cttattcata ccattgtgtg acaattgcct
4921 aaagtattca gcaaagtgac atgctgttcg ggtttgaagc caaggagcaa taggctacac
4981 cacatagccc aggtgtgtag cggctctgcc aaagctctct atgactgctc tgggacggcc
5041 tcaccatggc atcacctaat gatggatttc tgaccacgga tccccgtcgt aagtgatgct
5101 aactatactt acgagtacat tgtgatgagt cattatttac aggtcatttt aaacacccat
5161 acatgattta caggactcta tctcctagaa ctttaatttc ctacagacag tcacaagaaa
5221 aacattaaga tgaacttaga gaacaagcct gagtgctttc tccttgcatt tcataacaaa
5281 aggcaatgtt cataggggga tttcagatag tgtgttgatt tagggagaaa aatcacagtg
5341 ctaaagaatt c
//GenBank-Updates@genbank.bio.net (05/22/91)
LOCUS HUMPGEP102 5351 bp ds-DNA PRI 22-MAY-1991
DEFINITION Human snRNP E protein pseudogene 110.
ACCESSION M65235 M25912
KEYWORDS E protein; pseudogene; small nuclear ribonucleoprotein; snRNP.
SEGMENT 2 of 2
SOURCE Human fetal liver DNA, clone 110.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 5351)
AUTHORS Stanford,D.R., Holicky,E.L., Perry,C.A., Rehder,K.J., Harvey,S.E.,
Rohleder,A.M. and Wieben,E.D.
TITLE The snRNP E protein multigene family contains five pseudogenes with
common mutations
JOURNAL DNA Sequence (1991) In press
STANDARD full staff_entry
FEATURES Location/Qualifiers
repeat_unit 604..1087
/rpt_family="KpnI"
repeat_unit 1607..2305
/rpt_family="KpnI"
repeat_unit 2297..2313
/rpt_type=direct
mRNA 2313..2775
/pseudo=gene
CDS 2331..2432
/pseudo=gene
/codon_start=2331
polyA_signal 2721..2726
polyA_site 2747..2747
repeat_unit 2771..2787
/rpt_type=direct
misc_feature 2783..2846
/note="KpnI-like region"
repeat_unit 2835..2845
/rpt_type=direct
repeat_unit 2849..3157
/rpt_family="Alu"
repeat_unit 3153..3163
/rpt_type=direct
repeat_unit 3166..3900
/rpt_family="KpnI"
repeat_unit 3938..3950
/rpt_type=direct
repeat_unit 3950..4249
/rpt_family="Alu"
repeat_unit 4250..4260
/rpt_type=direct
BASE COUNT 1826 a 1025 c 1050 g 1450 t
ORIGIN About 900 bp after segment 1.
1 ccatggccgt ggtgacaggc agccacccac agccaccgca gggacgggag gggagcagtt
61 atgggctggt tacaacttat ggtcacagaa aataaaacgg ggagacgtcc agctctgggt
121 ggaaaagccc catcttttta agcctggaaa ctatgccatg tctccacgtg cgtattaaag
181 atttcccaat tcggaagtca ctgagcacca gtccttctaa tagagcctca gatagatagt
241 ctgtttcttg aagatcaagt ttgaaacttc actctctgga gaaaagaaca ggatatacat
301 gaaatccgca gaaacacaaa gaaccatcct tcaatgtaaa cttaaaaatt acaaaataga
361 agtaaaatag gaagtaactt ttccctccct tcactcaacc accattcacc tgaaatcctt
421 gccgaaacac ttcaaagtgt gtgggggggc tggaggcaag ttgcccggat gcgggaccct
481 catgcactat gaagggagtg caaattgcta cagtcccttc cagaagcgac aagacaatat
541 atatattctc taaatattct agccatttca cccccaaatc cctctgctgg gaacctgccc
601 taaggaagca attcaaagat caaatcgtat gacaaaacat gttcactgtg gtattcccta
661 taaattaaaa gtcaaatgct acagccataa aaaggacgag gtcatgtcct ttgcagggac
721 atgggtgaag ttggaagcca tcatccttag caaactaaca taggaacaga aaaccaaaca
781 ctgcatgttc tcactcataa gtggaagttg aacaatgaga acacatggac acagggcagg
841 ggaacaacac acaccagggg gtgtggggag tggggggaga gagagcatca ggataaatag
901 ctaatgcatg cggggcttaa aacctagatc gatgggttga tatgcgcagc aaaccgccat
961 ggcacatgtt tacctatgta acaaacctgc acattctgca cttgtatccc ggaacttaaa
1021 gtaaaaaaga aataaataaa gtcaaacgca acttaagtat ttaacatctg ggcatcataa
1081 gtaaattatc ttgcatcaaa tcaatataat attatgtagt atttaaaatt gtaattatgc
1141 taagtatagt aacaaagaaa ataacttaca gaaaatatct gtatattgga caaggtcaag
1201 aaggcactcc ccaaaatgaa agtcatttat ttacctttgt atctaacctg tgccttcagc
1261 tttattccag aaattctcag ggaacccttg tacacatgct ttacctagga tgctggaaat
1321 ggggtttatt cctttaaaaa catttttgac aatttattct gagaaaatta tcatgcactc
1381 attgctgtac tgctactatt ttctaaatgg gaatagcata aatgtccaat aataagggat
1441 ttaaaaaata atagataccc aaacaatgga atattgtgct accaatcaaa cttatgttgt
1501 aggagaataa tgacatcaga aaatgtttct gatatattaa atgaagtaaa ttacaaaatc
1561 gatttaaact aggttcgctt ttaaaaggta tatattttta attttaattc cttaatattt
1621 ttcttttata tataataaat gttaattcaa aatggaaaaa gtgttccaga gctgagcctg
1681 ataatttaag aataaaaata tccaagggtt aagagctaaa actataaatt ttgaagaaac
1741 atagttgtaa atctttataa ccttggatta ggaaacaaca gacatgtgac aactaaagaa
1801 aaaatattag ataaactgga aagagaaaag acatggaaga aaattttttc aaatcatatt
1861 tctgataagg gcctaatatc ccaaatctat aaagaactct taaaattcaa caataaaaag
1921 acaacaaatc caattcaaaa atgggcaaag gatttgaata gacatgatcc caagctgtac
1981 aaacagtcaa caagtgcatg aataaatgcc caccgtcatt agttattaaa ggaatataaa
2041 gcaaaaacca caaggagata ccccttcaca cccgttagga ccacttaaag aaagaaagat
2101 atggtctatc ttgtaattgt ctgacaagtt tagttaagga agatgtggag aaactagatc
2161 cctcctgcat ttctagtgag aaccggaaaa tgatgcagcc acttaccaaa ccagtttagc
2221 agttcctcaa aagttaaaca taatgttacc atttgactcg agcaattcca ctgctaggta
2281 tgtacccaag agaactgaaa acatatgtcc tcctcttgtg aaattccacc atggtgtacc
2341 atggccaggg ccagaaagtg caaagggtta tggtgcagcc catcaacctc atcttcagat
2401 acttacaaaa tagatcacgg attcaagtgt agctctatga gcaagtgaat atgaggatag
2461 aaggctgtat cattggtttt gatgagcata tgaaccttgt attagattat gcgaagagat
2521 tcattctaaa acaaagtcaa gaaaacaact gggtcggctc atgctaaaag gagataatat
2581 tcctctgctg caaagtgtct ccaactagaa atgatcaatg aagtgagaaa ttgttgagaa
2641 ggatacagtt tgtttttaga tgtccaatat gaacatttat tcatattgtt ttgattaccc
2701 ttatgttatt acaagatggc aataaatgct gtgggattgt ttgtactaaa aaaaaaaaag
2761 aaagaaaaaa gaaaacatat gtcctcccaa aatttggtac aagaatgttt atgacagcat
2821 gatttataac agtcaaaaag ggaaggaggg ctgggcgttg gtggctacac gtgtaatcct
2881 agcattttgg aaggccaagg tggtggatca cctgaggtca ggagtttaga ggaccagccc
2941 ggccaacatg atgaacatgc ccccacctct actaaaaaca caaaaattag ccgggcttgg
3001 tggtgggcac cttgtaatcc aagatgcttg ggaggctgag gcaggagaat cgcttgaacc
3061 tgggaggcag aggttgcagt gagccaagat ggcgcccatg cactccagcc tggggaccag
3121 agtgagactc tgtctaaaaa ttaataaaaa taaaaaagag aagcaaccca aaagtccact
3181 aactgatgag ttaataaata aaacatgata tatccatatc tataccatgg aatattattt
3241 gccaataaaa agaaagaata attaacgcat gctacaacat aaacaaatca tggaacatta
3301 tgctcagtgc aagaagccag acactaaagg tgtgcattat actctttcat ttgtaggaaa
3361 tacggcaaat ccgtaaagac agaaagtaga tgatcagtgg ttgccagagg ctgggggaag
3421 agggaatggg aagtgaatgc taatgggttg ccgggtttct ttttgaggca gcaaaaatgt
3481 ccttgaatta gatagtgctg atgactgcac acctttgtga atatactaaa aaccactggg
3541 ctgtactttt taaagaggtg aattttatga catatgtaac tatatcccaa ttttaattat
3601 gcaaatatat acaataatgc accacataac agtacttcag tcaatgtcag accacatgta
3661 ccatggccat ccaatgagat tataatgaag ctgaaaagtt cctgtctcct ggtggtgatg
3721 ctatagctgt catcatggca ttaacccaac acattactca cgtgtttaat gtaaacaaac
3781 ctactgcgct accagttgta taaaaatcta gcacatagaa ttatgaatag tacataatac
3841 ttgataatcg ataataaatg actattatca taatatagta tgttgtacta atttatgtac
3901 aattcaatat atgtttatac gtattttaga gtatacttcc tttccaacct tttttttttt
3961 tttgagacgg agcctcgctc tgtcacccaa gactggaggg cagtggtgca atctcggctc
4021 cctgcaacct ctggcctccc aggttgcaag cgattctcct gcctcagcct cccaagcagc
4081 tgggatttta ggcacccgcc accaggccca gctaattttt gtgtttttac tagagaaagg
4141 gtttcaccag gttggccaag ttggtctcaa actcctgacc tcaggtgatc cacccgcctc
4201 gggcctccca aagtgctggg attacaggcc tgagccaccg caccctgcct ccttctactt
4261 attaaaaagc aaaagttaac tataaaacag ccccaggcag gtccatcagg agatatttca
4321 gaggaaggca ttgctatcat accagataga cgtccatgtg tgtttctgcc cctgaagacc
4381 ttccagtggg acaagatgtg gagatgcaag acagtgatat taatgatcct gaccctgggt
4441 gggcctagga taatgtgtgt gtgttttagc ttttaacaaa aaagtcaaat ttttaaaaat
4501 ttttaaatag aaaaaaaacg tatagaataa ggatataaaa tatttttgta gagctgtaca
4561 ttgtgtctgt gtttaagctg ttattacaaa agtcaaagtt gaaaatagaa gtttattagt
4621 aaaatgttac atagctaagg taatttattg aagcagaaat tttttttaat taaatttgaa
4681 accatagctg aatgtgacag tgttacaaag tctacaggaa tgtacactaa tatctaagcc
4741 ttcacactca ctcaccactc actcactcac tcacccaaag caactccagt cctgcaagct
4801 ccattcacgg taagcgtcct agatgggtgt actaggtttt catcttttat gccatattct
4861 tactggacct tttctatatt tatataaaca cttattcata ccattgtgtg acaattgcct
4921 aaagtattca gcaaagtgac atgctgttcg ggtttgaagc caaggagcaa taggctacac
4981 cacatagccc aggtgtgtag cggctctgcc aaagctctct atgactgctc tgggacggcc
5041 tcaccatggc atcacctaat gatggatttc tgaccacgga tccccgtcgt aagtgatgct
5101 aactatactt acgagtacat tgtgatgagt cattatttac aggtcatttt aaacacccat
5161 acatgattta caggactcta tctcctagaa ctttaatttc ctacagacag tcacaagaaa
5221 aacattaaga tgaacttaga gaacaagcct gagtgctttc tccttgcatt tcataacaaa
5281 aggcaatgtt cataggggga tttcagatag tgtgttgatt tagggagaaa aatcacagtg
5341 ctaaagaatt c
//