GenBank-Updates@genbank.bio.net (05/17/91)
LOCUS HUMPGEP11B 5351 bp ds-DNA PRI 17-MAY-1991 DEFINITION Human snRNP E protein pseudogene 110. ACCESSION M65235 M25912 KEYWORDS E protein; pseudogene; small nuclear ribonucleoprotein; snRNP. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 5351) AUTHORS Stanford,D.R., Holicky,E.L., Perry,C.A., Rehder,K.J., Harvey,S.E., Rohleder,A.M. and Wieben,E.D. TITLE The snRNP E protein multigene family contains five pseudogenes with common mutations JOURNAL DNA (1991) In press STANDARD full staff_entry FEATURES Location/Qualifiers repeat_unit 604..1087 /rpt_family="KpnI" repeat_unit 1607..2305 /rpt_family="KpnI" repeat_unit 2297..2313 /rpt_type=direct CDS 2313..2775 /pseudo /codon_start=2313 promoter 2328..2334 /note="translation initiation sequence" polyA_signal 2721..2726 polyA_site 2747..2747 repeat_unit 2771..2787 /rpt_type=direct misc_feature 2783..2846 /note="KpnI-like region" repeat_unit 2835..2845 /rpt_type=direct repeat_unit 2849..3157 /rpt_family="Alu" repeat_unit 3153..3163 /rpt_type=direct repeat_unit 3166..3900 /rpt_family="KpnI" repeat_unit 3938..3950 /rpt_type=direct repeat_unit 3950..4249 /rpt_family="Alu" repeat_unit 4250..4260 /rpt_type=direct BASE COUNT 1826 a 1025 c 1050 g 1450 t ORIGIN 1 ccatggccgt ggtgacaggc agccacccac agccaccgca gggacgggag gggagcagtt 61 atgggctggt tacaacttat ggtcacagaa aataaaacgg ggagacgtcc agctctgggt 121 ggaaaagccc catcttttta agcctggaaa ctatgccatg tctccacgtg cgtattaaag 181 atttcccaat tcggaagtca ctgagcacca gtccttctaa tagagcctca gatagatagt 241 ctgtttcttg aagatcaagt ttgaaacttc actctctgga gaaaagaaca ggatatacat 301 gaaatccgca gaaacacaaa gaaccatcct tcaatgtaaa cttaaaaatt acaaaataga 361 agtaaaatag gaagtaactt ttccctccct tcactcaacc accattcacc tgaaatcctt 421 gccgaaacac ttcaaagtgt gtgggggggc tggaggcaag ttgcccggat gcgggaccct 481 catgcactat gaagggagtg caaattgcta cagtcccttc cagaagcgac aagacaatat 541 atatattctc taaatattct agccatttca cccccaaatc cctctgctgg gaacctgccc 601 taaggaagca attcaaagat caaatcgtat gacaaaacat gttcactgtg gtattcccta 661 taaattaaaa gtcaaatgct acagccataa aaaggacgag gtcatgtcct ttgcagggac 721 atgggtgaag ttggaagcca tcatccttag caaactaaca taggaacaga aaaccaaaca 781 ctgcatgttc tcactcataa gtggaagttg aacaatgaga acacatggac acagggcagg 841 ggaacaacac acaccagggg gtgtggggag tggggggaga gagagcatca ggataaatag 901 ctaatgcatg cggggcttaa aacctagatc gatgggttga tatgcgcagc aaaccgccat 961 ggcacatgtt tacctatgta acaaacctgc acattctgca cttgtatccc ggaacttaaa 1021 gtaaaaaaga aataaataaa gtcaaacgca acttaagtat ttaacatctg ggcatcataa 1081 gtaaattatc ttgcatcaaa tcaatataat attatgtagt atttaaaatt gtaattatgc 1141 taagtatagt aacaaagaaa ataacttaca gaaaatatct gtatattgga caaggtcaag 1201 aaggcactcc ccaaaatgaa agtcatttat ttacctttgt atctaacctg tgccttcagc 1261 tttattccag aaattctcag ggaacccttg tacacatgct ttacctagga tgctggaaat 1321 ggggtttatt cctttaaaaa catttttgac aatttattct gagaaaatta tcatgcactc 1381 attgctgtac tgctactatt ttctaaatgg gaatagcata aatgtccaat aataagggat 1441 ttaaaaaata atagataccc aaacaatgga atattgtgct accaatcaaa cttatgttgt 1501 aggagaataa tgacatcaga aaatgtttct gatatattaa atgaagtaaa ttacaaaatc 1561 gatttaaact aggttcgctt ttaaaaggta tatattttta attttaattc cttaatattt 1621 ttcttttata tataataaat gttaattcaa aatggaaaaa gtgttccaga gctgagcctg 1681 ataatttaag aataaaaata tccaagggtt aagagctaaa actataaatt ttgaagaaac 1741 atagttgtaa atctttataa ccttggatta ggaaacaaca gacatgtgac aactaaagaa 1801 aaaatattag ataaactgga aagagaaaag acatggaaga aaattttttc aaatcatatt 1861 tctgataagg gcctaatatc ccaaatctat aaagaactct taaaattcaa caataaaaag 1921 acaacaaatc caattcaaaa atgggcaaag gatttgaata gacatgatcc caagctgtac 1981 aaacagtcaa caagtgcatg aataaatgcc caccgtcatt agttattaaa ggaatataaa 2041 gcaaaaacca caaggagata ccccttcaca cccgttagga ccacttaaag aaagaaagat 2101 atggtctatc ttgtaattgt ctgacaagtt tagttaagga agatgtggag aaactagatc 2161 cctcctgcat ttctagtgag aaccggaaaa tgatgcagcc acttaccaaa ccagtttagc 2221 agttcctcaa aagttaaaca taatgttacc atttgactcg agcaattcca ctgctaggta 2281 tgtacccaag agaactgaaa acatatgtcc tcctcttgtg aaattccacc atggtgtacc 2341 atggccaggg ccagaaagtg caaagggtta tggtgcagcc catcaacctc atcttcagat 2401 acttacaaaa tagatcacgg attcaagtgt agctctatga gcaagtgaat atgaggatag 2461 aaggctgtat cattggtttt gatgagcata tgaaccttgt attagattat gcgaagagat 2521 tcattctaaa acaaagtcaa gaaaacaact gggtcggctc atgctaaaag gagataatat 2581 tcctctgctg caaagtgtct ccaactagaa atgatcaatg aagtgagaaa ttgttgagaa 2641 ggatacagtt tgtttttaga tgtccaatat gaacatttat tcatattgtt ttgattaccc 2701 ttatgttatt acaagatggc aataaatgct gtgggattgt ttgtactaaa aaaaaaaaag 2761 aaagaaaaaa gaaaacatat gtcctcccaa aatttggtac aagaatgttt atgacagcat 2821 gatttataac agtcaaaaag ggaaggaggg ctgggcgttg gtggctacac gtgtaatcct 2881 agcattttgg aaggccaagg tggtggatca cctgaggtca ggagtttaga ggaccagccc 2941 ggccaacatg atgaacatgc ccccacctct actaaaaaca caaaaattag ccgggcttgg 3001 tggtgggcac cttgtaatcc aagatgcttg ggaggctgag gcaggagaat cgcttgaacc 3061 tgggaggcag aggttgcagt gagccaagat ggcgcccatg cactccagcc tggggaccag 3121 agtgagactc tgtctaaaaa ttaataaaaa taaaaaagag aagcaaccca aaagtccact 3181 aactgatgag ttaataaata aaacatgata tatccatatc tataccatgg aatattattt 3241 gccaataaaa agaaagaata attaacgcat gctacaacat aaacaaatca tggaacatta 3301 tgctcagtgc aagaagccag acactaaagg tgtgcattat actctttcat ttgtaggaaa 3361 tacggcaaat ccgtaaagac agaaagtaga tgatcagtgg ttgccagagg ctgggggaag 3421 agggaatggg aagtgaatgc taatgggttg ccgggtttct ttttgaggca gcaaaaatgt 3481 ccttgaatta gatagtgctg atgactgcac acctttgtga atatactaaa aaccactggg 3541 ctgtactttt taaagaggtg aattttatga catatgtaac tatatcccaa ttttaattat 3601 gcaaatatat acaataatgc accacataac agtacttcag tcaatgtcag accacatgta 3661 ccatggccat ccaatgagat tataatgaag ctgaaaagtt cctgtctcct ggtggtgatg 3721 ctatagctgt catcatggca ttaacccaac acattactca cgtgtttaat gtaaacaaac 3781 ctactgcgct accagttgta taaaaatcta gcacatagaa ttatgaatag tacataatac 3841 ttgataatcg ataataaatg actattatca taatatagta tgttgtacta atttatgtac 3901 aattcaatat atgtttatac gtattttaga gtatacttcc tttccaacct tttttttttt 3961 tttgagacgg agcctcgctc tgtcacccaa gactggaggg cagtggtgca atctcggctc 4021 cctgcaacct ctggcctccc aggttgcaag cgattctcct gcctcagcct cccaagcagc 4081 tgggatttta ggcacccgcc accaggccca gctaattttt gtgtttttac tagagaaagg 4141 gtttcaccag gttggccaag ttggtctcaa actcctgacc tcaggtgatc cacccgcctc 4201 gggcctccca aagtgctggg attacaggcc tgagccaccg caccctgcct ccttctactt 4261 attaaaaagc aaaagttaac tataaaacag ccccaggcag gtccatcagg agatatttca 4321 gaggaaggca ttgctatcat accagataga cgtccatgtg tgtttctgcc cctgaagacc 4381 ttccagtggg acaagatgtg gagatgcaag acagtgatat taatgatcct gaccctgggt 4441 gggcctagga taatgtgtgt gtgttttagc ttttaacaaa aaagtcaaat ttttaaaaat 4501 ttttaaatag aaaaaaaacg tatagaataa ggatataaaa tatttttgta gagctgtaca 4561 ttgtgtctgt gtttaagctg ttattacaaa agtcaaagtt gaaaatagaa gtttattagt 4621 aaaatgttac atagctaagg taatttattg aagcagaaat tttttttaat taaatttgaa 4681 accatagctg aatgtgacag tgttacaaag tctacaggaa tgtacactaa tatctaagcc 4741 ttcacactca ctcaccactc actcactcac tcacccaaag caactccagt cctgcaagct 4801 ccattcacgg taagcgtcct agatgggtgt actaggtttt catcttttat gccatattct 4861 tactggacct tttctatatt tatataaaca cttattcata ccattgtgtg acaattgcct 4921 aaagtattca gcaaagtgac atgctgttcg ggtttgaagc caaggagcaa taggctacac 4981 cacatagccc aggtgtgtag cggctctgcc aaagctctct atgactgctc tgggacggcc 5041 tcaccatggc atcacctaat gatggatttc tgaccacgga tccccgtcgt aagtgatgct 5101 aactatactt acgagtacat tgtgatgagt cattatttac aggtcatttt aaacacccat 5161 acatgattta caggactcta tctcctagaa ctttaatttc ctacagacag tcacaagaaa 5221 aacattaaga tgaacttaga gaacaagcct gagtgctttc tccttgcatt tcataacaaa 5281 aggcaatgtt cataggggga tttcagatag tgtgttgatt tagggagaaa aatcacagtg 5341 ctaaagaatt c //
GenBank-Updates@genbank.bio.net (05/22/91)
LOCUS HUMPGEP102 5351 bp ds-DNA PRI 22-MAY-1991 DEFINITION Human snRNP E protein pseudogene 110. ACCESSION M65235 M25912 KEYWORDS E protein; pseudogene; small nuclear ribonucleoprotein; snRNP. SEGMENT 2 of 2 SOURCE Human fetal liver DNA, clone 110. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 5351) AUTHORS Stanford,D.R., Holicky,E.L., Perry,C.A., Rehder,K.J., Harvey,S.E., Rohleder,A.M. and Wieben,E.D. TITLE The snRNP E protein multigene family contains five pseudogenes with common mutations JOURNAL DNA Sequence (1991) In press STANDARD full staff_entry FEATURES Location/Qualifiers repeat_unit 604..1087 /rpt_family="KpnI" repeat_unit 1607..2305 /rpt_family="KpnI" repeat_unit 2297..2313 /rpt_type=direct mRNA 2313..2775 /pseudo=gene CDS 2331..2432 /pseudo=gene /codon_start=2331 polyA_signal 2721..2726 polyA_site 2747..2747 repeat_unit 2771..2787 /rpt_type=direct misc_feature 2783..2846 /note="KpnI-like region" repeat_unit 2835..2845 /rpt_type=direct repeat_unit 2849..3157 /rpt_family="Alu" repeat_unit 3153..3163 /rpt_type=direct repeat_unit 3166..3900 /rpt_family="KpnI" repeat_unit 3938..3950 /rpt_type=direct repeat_unit 3950..4249 /rpt_family="Alu" repeat_unit 4250..4260 /rpt_type=direct BASE COUNT 1826 a 1025 c 1050 g 1450 t ORIGIN About 900 bp after segment 1. 1 ccatggccgt ggtgacaggc agccacccac agccaccgca gggacgggag gggagcagtt 61 atgggctggt tacaacttat ggtcacagaa aataaaacgg ggagacgtcc agctctgggt 121 ggaaaagccc catcttttta agcctggaaa ctatgccatg tctccacgtg cgtattaaag 181 atttcccaat tcggaagtca ctgagcacca gtccttctaa tagagcctca gatagatagt 241 ctgtttcttg aagatcaagt ttgaaacttc actctctgga gaaaagaaca ggatatacat 301 gaaatccgca gaaacacaaa gaaccatcct tcaatgtaaa cttaaaaatt acaaaataga 361 agtaaaatag gaagtaactt ttccctccct tcactcaacc accattcacc tgaaatcctt 421 gccgaaacac ttcaaagtgt gtgggggggc tggaggcaag ttgcccggat gcgggaccct 481 catgcactat gaagggagtg caaattgcta cagtcccttc cagaagcgac aagacaatat 541 atatattctc taaatattct agccatttca cccccaaatc cctctgctgg gaacctgccc 601 taaggaagca attcaaagat caaatcgtat gacaaaacat gttcactgtg gtattcccta 661 taaattaaaa gtcaaatgct acagccataa aaaggacgag gtcatgtcct ttgcagggac 721 atgggtgaag ttggaagcca tcatccttag caaactaaca taggaacaga aaaccaaaca 781 ctgcatgttc tcactcataa gtggaagttg aacaatgaga acacatggac acagggcagg 841 ggaacaacac acaccagggg gtgtggggag tggggggaga gagagcatca ggataaatag 901 ctaatgcatg cggggcttaa aacctagatc gatgggttga tatgcgcagc aaaccgccat 961 ggcacatgtt tacctatgta acaaacctgc acattctgca cttgtatccc ggaacttaaa 1021 gtaaaaaaga aataaataaa gtcaaacgca acttaagtat ttaacatctg ggcatcataa 1081 gtaaattatc ttgcatcaaa tcaatataat attatgtagt atttaaaatt gtaattatgc 1141 taagtatagt aacaaagaaa ataacttaca gaaaatatct gtatattgga caaggtcaag 1201 aaggcactcc ccaaaatgaa agtcatttat ttacctttgt atctaacctg tgccttcagc 1261 tttattccag aaattctcag ggaacccttg tacacatgct ttacctagga tgctggaaat 1321 ggggtttatt cctttaaaaa catttttgac aatttattct gagaaaatta tcatgcactc 1381 attgctgtac tgctactatt ttctaaatgg gaatagcata aatgtccaat aataagggat 1441 ttaaaaaata atagataccc aaacaatgga atattgtgct accaatcaaa cttatgttgt 1501 aggagaataa tgacatcaga aaatgtttct gatatattaa atgaagtaaa ttacaaaatc 1561 gatttaaact aggttcgctt ttaaaaggta tatattttta attttaattc cttaatattt 1621 ttcttttata tataataaat gttaattcaa aatggaaaaa gtgttccaga gctgagcctg 1681 ataatttaag aataaaaata tccaagggtt aagagctaaa actataaatt ttgaagaaac 1741 atagttgtaa atctttataa ccttggatta ggaaacaaca gacatgtgac aactaaagaa 1801 aaaatattag ataaactgga aagagaaaag acatggaaga aaattttttc aaatcatatt 1861 tctgataagg gcctaatatc ccaaatctat aaagaactct taaaattcaa caataaaaag 1921 acaacaaatc caattcaaaa atgggcaaag gatttgaata gacatgatcc caagctgtac 1981 aaacagtcaa caagtgcatg aataaatgcc caccgtcatt agttattaaa ggaatataaa 2041 gcaaaaacca caaggagata ccccttcaca cccgttagga ccacttaaag aaagaaagat 2101 atggtctatc ttgtaattgt ctgacaagtt tagttaagga agatgtggag aaactagatc 2161 cctcctgcat ttctagtgag aaccggaaaa tgatgcagcc acttaccaaa ccagtttagc 2221 agttcctcaa aagttaaaca taatgttacc atttgactcg agcaattcca ctgctaggta 2281 tgtacccaag agaactgaaa acatatgtcc tcctcttgtg aaattccacc atggtgtacc 2341 atggccaggg ccagaaagtg caaagggtta tggtgcagcc catcaacctc atcttcagat 2401 acttacaaaa tagatcacgg attcaagtgt agctctatga gcaagtgaat atgaggatag 2461 aaggctgtat cattggtttt gatgagcata tgaaccttgt attagattat gcgaagagat 2521 tcattctaaa acaaagtcaa gaaaacaact gggtcggctc atgctaaaag gagataatat 2581 tcctctgctg caaagtgtct ccaactagaa atgatcaatg aagtgagaaa ttgttgagaa 2641 ggatacagtt tgtttttaga tgtccaatat gaacatttat tcatattgtt ttgattaccc 2701 ttatgttatt acaagatggc aataaatgct gtgggattgt ttgtactaaa aaaaaaaaag 2761 aaagaaaaaa gaaaacatat gtcctcccaa aatttggtac aagaatgttt atgacagcat 2821 gatttataac agtcaaaaag ggaaggaggg ctgggcgttg gtggctacac gtgtaatcct 2881 agcattttgg aaggccaagg tggtggatca cctgaggtca ggagtttaga ggaccagccc 2941 ggccaacatg atgaacatgc ccccacctct actaaaaaca caaaaattag ccgggcttgg 3001 tggtgggcac cttgtaatcc aagatgcttg ggaggctgag gcaggagaat cgcttgaacc 3061 tgggaggcag aggttgcagt gagccaagat ggcgcccatg cactccagcc tggggaccag 3121 agtgagactc tgtctaaaaa ttaataaaaa taaaaaagag aagcaaccca aaagtccact 3181 aactgatgag ttaataaata aaacatgata tatccatatc tataccatgg aatattattt 3241 gccaataaaa agaaagaata attaacgcat gctacaacat aaacaaatca tggaacatta 3301 tgctcagtgc aagaagccag acactaaagg tgtgcattat actctttcat ttgtaggaaa 3361 tacggcaaat ccgtaaagac agaaagtaga tgatcagtgg ttgccagagg ctgggggaag 3421 agggaatggg aagtgaatgc taatgggttg ccgggtttct ttttgaggca gcaaaaatgt 3481 ccttgaatta gatagtgctg atgactgcac acctttgtga atatactaaa aaccactggg 3541 ctgtactttt taaagaggtg aattttatga catatgtaac tatatcccaa ttttaattat 3601 gcaaatatat acaataatgc accacataac agtacttcag tcaatgtcag accacatgta 3661 ccatggccat ccaatgagat tataatgaag ctgaaaagtt cctgtctcct ggtggtgatg 3721 ctatagctgt catcatggca ttaacccaac acattactca cgtgtttaat gtaaacaaac 3781 ctactgcgct accagttgta taaaaatcta gcacatagaa ttatgaatag tacataatac 3841 ttgataatcg ataataaatg actattatca taatatagta tgttgtacta atttatgtac 3901 aattcaatat atgtttatac gtattttaga gtatacttcc tttccaacct tttttttttt 3961 tttgagacgg agcctcgctc tgtcacccaa gactggaggg cagtggtgca atctcggctc 4021 cctgcaacct ctggcctccc aggttgcaag cgattctcct gcctcagcct cccaagcagc 4081 tgggatttta ggcacccgcc accaggccca gctaattttt gtgtttttac tagagaaagg 4141 gtttcaccag gttggccaag ttggtctcaa actcctgacc tcaggtgatc cacccgcctc 4201 gggcctccca aagtgctggg attacaggcc tgagccaccg caccctgcct ccttctactt 4261 attaaaaagc aaaagttaac tataaaacag ccccaggcag gtccatcagg agatatttca 4321 gaggaaggca ttgctatcat accagataga cgtccatgtg tgtttctgcc cctgaagacc 4381 ttccagtggg acaagatgtg gagatgcaag acagtgatat taatgatcct gaccctgggt 4441 gggcctagga taatgtgtgt gtgttttagc ttttaacaaa aaagtcaaat ttttaaaaat 4501 ttttaaatag aaaaaaaacg tatagaataa ggatataaaa tatttttgta gagctgtaca 4561 ttgtgtctgt gtttaagctg ttattacaaa agtcaaagtt gaaaatagaa gtttattagt 4621 aaaatgttac atagctaagg taatttattg aagcagaaat tttttttaat taaatttgaa 4681 accatagctg aatgtgacag tgttacaaag tctacaggaa tgtacactaa tatctaagcc 4741 ttcacactca ctcaccactc actcactcac tcacccaaag caactccagt cctgcaagct 4801 ccattcacgg taagcgtcct agatgggtgt actaggtttt catcttttat gccatattct 4861 tactggacct tttctatatt tatataaaca cttattcata ccattgtgtg acaattgcct 4921 aaagtattca gcaaagtgac atgctgttcg ggtttgaagc caaggagcaa taggctacac 4981 cacatagccc aggtgtgtag cggctctgcc aaagctctct atgactgctc tgggacggcc 5041 tcaccatggc atcacctaat gatggatttc tgaccacgga tccccgtcgt aagtgatgct 5101 aactatactt acgagtacat tgtgatgagt cattatttac aggtcatttt aaacacccat 5161 acatgattta caggactcta tctcctagaa ctttaatttc ctacagacag tcacaagaaa 5221 aacattaaga tgaacttaga gaacaagcct gagtgctttc tccttgcatt tcataacaaa 5281 aggcaatgtt cataggggga tttcagatag tgtgttgatt tagggagaaa aatcacagtg 5341 ctaaagaatt c //