[bionet.molbio.genbank.updates] Human PRB4 gene for proline-rich protein Po, allele S

GenBank-Updates@genbank.bio.net (05/30/91)

LOCUS       HUMPRB4S     5813 bp ds-DNA             PRI       30-MAY-1991
DEFINITION  Human PRB4 gene for proline-rich protein Po, allele S
ACCESSION   X07882
KEYWORDS    PRB4 gene; proline-rich protein; protein Po.
SOURCE      Homo sapiens DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 5813)
  AUTHORS   Lyons,K.M.
  JOURNAL   Unpublished (1988)
  STANDARD  full automatic
REFERENCE   2  (bases 1934 to 2798)
  AUTHORS   Lyons,K.M., Stein,J.H. and Smithies,O.
  TITLE     Length polymorphisms in human proline-rich protein genes generated
            by intragenic unequal crossing over
  JOURNAL   Genetics 120, 267-278 (1988)
  STANDARD  full automatic
COMMENT     **map: chromosomal location=chromosome 12p13.2;
            
            From EMBL    entry HSPRB4S;  dated 24-JUL-1990.
FEATURES             Location/Qualifiers
     promoter        564..569
                     /note="TATA-box"
     precursor_RNA   596..3951
                     /note="primary transcript"
     mRNA            596..692
                     /note="Exon 1"
     CDS             629..692
                     /note="Po protein (AA 1 - 21) (692 is 1st base in codon)"
                     /codon_start=629
     intron          693..1621
                     /note="Intron I"
     mRNA            1622..1657
                     /note="Exon 2"
     CDS             1622..1657
                     /note="Po protein (AA 22 - 33) (1622 is 2nd base in codon)
                     (1657 is 1st base in codon)"
                     /codon_start=1622
     intron          1658..2212
                     /note="Intron II"
     mRNA            2213..2811
                     /note="Exon 3"
     CDS             2213..2790
                     /note="Po protein (AA 34 - 226) (2213 is 2nd base in
                     codon)"
                     /codon_start=2213
     intron          2812..3834
                     /note="Intron III"
     mRNA            3835..3951
                     /note="Exon 4"
     misc_feature    3933..3938
                     /note="polyA signal"
     polyA_site      3951..3951
                     /note="polyA site"
BASE COUNT     1647 a   1428 c   1094 g   1644 t
ORIGIN
        1 aagcttgaac aatagttgta gcccgagcag gctgtggggc agctttctgt catctatgtg
       61 tgcccaggag tgtgttgtct tcacaccatc acacaggtaa cagaaaccat ctctatcgca
      121 tcaaaacttt agccaattgt gatactatct gtacacacac atcacaagag atggaaatag
      181 gctctataaa gagccagaga aattgaccat cttcactgct ggctgtagct tcactgcccc
      241 cacaaatatt acacagaaat aaatagtaga gtggaaagca gacatagtct taccaaaagg
      301 attgaaactc taatgatgtc tgaagttcaa ttatgacaca gtgctgatag cttggacaca
      361 gttcctttag aacctttatt caagtcatag tcgtaccttt tagaaataag gtacaaacaa
      421 catccaaccc accccaccct ctgggctaga gtcccaaaga gaaataaggg atacacctga
      481 cctgcagtaa ggaaagcaga acccgatctc tgaggtggtg aggcccaccc agggctcaaa
      541 ggtgccattg ttttgctcct ctttataaag ggagttgcca cgttcctccc agcacagagt
      601 tgggagtgac tccagagcct ccagcgagat gctgctgatt ctgctgtcag tggccctgct
      661 ggccttgagc tcagctgaga gttcaagtga aggtaaaaca gaagggggaa aagatgcggt
      721 gactgcttgg gacttaggag gtgacagtgg taattatggg gaagagagga gaatgaaaac
      781 acagatgggg ctgcagagtt ttcatgccta ggatcaggag acctgttgtg ccctcattcc
      841 acaataagga cttctaattt atttaatgta caatgaaatc caataacgaa tttgttccag
      901 gggaatgaga aggtaagatt tgaatttata gagatagaac tgtgctgtga aggctgcagt
      961 ggagagtgca aggcagattc agggaagtcc agctgtgaag atcctatact gatcccagta
     1021 agtacacagg gatgatggtg gccttgctgt acggacggtc ggcattgatg atagagatac
     1081 atacacatcg gagatactgc ggagacggaa ctggatagaa cacttatctc tgtctaacta
     1141 aagatgtaga aatatcagag ccaatcatta caatttttct ctcccctaca tgcagtattt
     1201 caatgtgctg ggagtggaat gggttagatt gtattgaaat gattacttct ggttacccct
     1261 attgagaaaa catgtgtatg tatgcaatat attaacagga gatggagggc ataagaacac
     1321 caaaatatca cattgaagta cctggcatgc ataaactaaa taagcattaa gtctcgaggg
     1381 atgctaggga ggaaaaaaag gggctgttct atgtcgaact cattgctgtt gctctgtgtg
     1441 gtaacaaccc tgcctcctct tacaccttcc accccttcca gcaccttcac agatggtggc
     1501 tgatgagtta acctagggga tgcatggggt gtggtgagaa gacaattttc cctgtagaac
     1561 acttgtgagt cttgaagatt tgagatgtaa catttcccat catcctgtgc ttctcttcta
     1621 gatgtcagcc aggaagaatc tctcttccta atatcaggta aatcccaatt cattctcaat
     1681 ctgttttgac tccctttttc tgcttacaaa tgggtcattt ctccagtgtc ttcttatcaa
     1741 cactttcctt tcaggaattg attaatgtta ttgcccctaa tgatataggc aatcttcatg
     1801 caaacttgat tctgggacca tgagcaggcc accaaatgga atgtcagaga tgcttgggtt
     1861 ggatgacaac aggagtgggt tgacatcccc ctgggagatg acagacaaat ggccagtgtc
     1921 cttattctga ctccttccta gactgggcct attctcctcc ttagactgag agcccctcaa
     1981 cttctccctt ttcccccagc gttccactcc agagttctag ggcttcactg aaaatgcaaa
     2041 gaaattagta tctgggtctc atttttgtgc atttccccat ttagctccat tactgtaaaa
     2101 atttgtggca actattcagt gaatgccgta tgtcccccac ctcctccagg aaagccagaa
     2161 ggacgacgcc cacaaggagg aaaccagccc caacgtcccc cacctcctcc aggaaagcca
     2221 caaggaccac ccccacaagg aggaaaccag tcccaaggtc ccccacctcc tccaggaaag
     2281 ccagaaggac gacccccaca aggaggcaac cagtcccaag gtcccccacc tcatccagga
     2341 aagccagaaa gaccaccccc acaaggagga aaccagtccc aaggtacccc acctcctcca
     2401 ggaaagccag aaagaccacc cccacaagga ggcaaccagt cccaccgtcc cccacctcct
     2461 ccaggaaagc cagaaagacc acccccacaa ggaggtaacc agtcccaagg tcccccacct
     2521 catccaggaa agccagaagg accaccccca caggaaggaa acaagtcccg aagtgcccga
     2581 tctcctccag gaaagccaca aggaccaccc caacaagaag gcaacaagcc tcaaggtccc
     2641 ccacctcctg gaaagccaca aggcccaccc ccagcaggag gcaatcccca gcagcctcag
     2701 gcacctcctg ctggaaagcc ccaggggcca cctccacctc ctcaaggggg caggccaccc
     2761 agacctgccc agggacaaca gcctccccag taatctagga ttcaatgaca ggtatgattc
     2821 cactttatta ttcatcagga ctctaattgc acagttctcc aactttattg tgccaatgaa
     2881 tcaactaaaa cccattgaca ttgtattgtc ctggaaccca tttctaaaaa tttgtattca
     2941 gatactctgg aatagggtaa ggggaccctg tatttctaac aaaatcgttt aaggaattct
     3001 gatgttgaga aacaacatac catatgatct gtcttaaatt gtgttggcaa tgaggaggta
     3061 gtaccatgtt cattcttggc gttctgtttt ctatccacta actcagagac ctcccattta
     3121 aagttttcac ctgagcacca tttgctcagt cctgcctcac accagcctct cgagtccagt
     3181 attcctgcca aatggtccct gatctttcag cagctaaatg gcgtgtcact ttttagatac
     3241 ttaacttttc aatacgtaca tgattaagct aacaaaaaat atctaatgga atggaaaaat
     3301 atgaagctaa ttttaaaggc ctaacacatc ctaccccacc ttccttcctt caaaaagctc
     3361 ccagtggtta accttatggg atcttttctt tgaaatattt atgtgtgcat agacatatag
     3421 cattctttta ccctaccact aatgccataa cttatatgca ggtatatatg ttagtcattt
     3481 aaaaaataca tttttttaaa atttccacat cagtttatga aggtcactac atatcttcag
     3541 tggttttctg tttgctttta catttttata ctactctatt gtgtagctgt gccatgattt
     3601 cgttaaccaa tccctgtcac tggacactga gggtggtttt agcttctcag tattatagaa
     3661 tatgttccag ttaccatctg tgtaaatata tccctgaaca aattcaacag caatgagtca
     3721 cagcaaccta aggatggtct tttctcttca tcttctaagc cacaatttgg agcacattgt
     3781 gtgcaagggc atcaaaagag tgaatctatg aacttgcttg tttgtttatt tcaggaagtg
     3841 aataagaaga tatcagtgaa ttcaaataat tcaattgcta caaatgccgt gacattggaa
     3901 caaggtcatc atagctctaa ctttaatata ccaataaaat aatcagcttg caatttctga
     3961 ttgtggtgtt ctttctcagt gtttgcggaa tgtggaatgt gaggaccaag aacacattat
     4021 aagaacatct aggacccctt ctgtctgatg cttccaagga gtttcccttc tctttaatcc
     4081 taacttagcc agctgccatg aaaaatgttt tgctgtttat ctctttccct gacttcaatt
     4141 tttttttctt tttctgagat ggagtcttgc tctatcacct aggttggaat gcagtggcgt
     4201 gatcttggct aactgcaacc tgcacctcca gggttcaagc tattctcctg cctcaccctt
     4261 cacagtagct gggattacag gttcccacca tcacaccagg gtaatctttg taattttagt
     4321 tgagatgtgt tttcaccctg ttggcaggct aggctagaaa ttctgacttc aggtcatccg
     4381 cttgccttgg cctccaaatg tgttgggatt acaggcatga gccaccacac ctagctcctt
     4441 tctgacttct acagcacaaa ttgaaaatct aaaattattt tcagattgtt tactgatatt
     4501 ccagtaattt taaggacaaa aaccacaaca aatggaaaat aagtcacaga aactaaaaga
     4561 aatccttata atttctgagt ttggtttcaa gggaacaaac agggttctat gcttcttatt
     4621 cccagagccc tctctatccc attgacccta ttttaacagt gatcacttcc ctccctccct
     4681 atgttcctca cctttcttta atgaaacctg aatggatttc atcaaggagg cagcatgact
     4741 tttaggagca aagaattggg acactctcag attttagtta agacataact ctttcttgct
     4801 agcctgaact cttaaaaagc tacttggtct ctcagagctt caatttcctc atctacaatg
     4861 agaagaatca aaacaactac cttagaatat ggagactatt cagataacat atgtaccaaa
     4921 aaccttgcag agattggcat gtctgcttct caagcaagga aggttcaata ttagaaaact
     4981 gcccctgtgc ccaccgatag cctcagataa ttcactatga atttcagaaa tttcagaata
     5041 gaaggatctc actgtaacca tcaccaagtt gagcaaccca cattcagttc aatcccagtt
     5101 ctctgacttc tctcctatta tcatagttga aggctcccta cccctatctc ttatctttcc
     5161 tcttgattct gaaccacatt acccagtcaa ggattttgct tctgtatgtg acccttttgt
     5221 ctcctgattc ttacatctat gctctatggg attacctcaa tcagcaaaag cctgctgaaa
     5281 catcacccat ttttacagag attttgctaa gactcttagt gtttctttcc taccctatta
     5341 tttgtctgcc atttttattg caaaagttct tgaaacatat gtatgtgatt atttccccat
     5401 cccctccctt ccattttctt tttaaacaca cattaaagat gcttttgttc tttccactcc
     5461 aagtctgtca aggtcatcta ctgcctgcat tccactcatt tcaggaatcg attatcagtc
     5521 ctgcatctcc tgtgacccct tggcagttta acaccattga tcctacaatt cttttgggaa
     5581 cactctatca atctttccgg gaacctccca ctctctcctc gggtttctcc tacctctgcc
     5641 tcttccctcc tgtaactaca aagttactcc tcttccaact ctatttggtc ttggtaaatt
     5701 attcatctaa ttaattaagg aaactaggat ttattctaga ctcttctctt ttcctcttac
     5761 atcacattac atctagtcaa atcagctatg ttatcattgt gaaattcaag ctt
//