[bionet.molbio.genbank.updates] Phaseolus vulgaris gene for alpha-phaseolin

GenBank-Updates@genbank.bio.net (05/26/91)

LOCUS       PHVAPHASE    4764 bp ds-DNA             PLN       26-MAY-1991
DEFINITION  Phaseolus vulgaris gene for alpha-phaseolin
ACCESSION   X52626
KEYWORDS    alpha-phaseolin; glycoprotein; phaseolin; seed storage protein.
SOURCE      Phaseolus vulgaris DNA.
  ORGANISM  Phaseolus vulgaris
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 4764)
  AUTHORS   Anthony,J.L.
  JOURNAL   Unpublished (1990)
  STANDARD  full automatic
REFERENCE   2  (bases 1987 to 4764)
  AUTHORS   Anthony,J.L., Vonder,H.R.A. and Hall,T.C.
  TITLE     Nucleotide sequence of an alpha-phaseolin gene from Phaseolus
            vulgaris
  JOURNAL   Nucleic Acids Res. 18, 3396-3396 (1990)
  STANDARD  full automatic
COMMENT     *source: strain=Sanilac; tissue=leaf; clone=9C;
            
            From EMBL    entry PVAPHASE;  dated 18-JUN-1990.
FEATURES             Location/Qualifiers
     promoter        2196..2207
                     /note="put. CAAt box"
     promoter        2232..2246
                     /note="put. TATA box"
     precursor_RNA   2275..>2274
                     /note="primary transcript"
     mRNA            2275..2664
                     /note="exon 1"
     CDS             2352..2664
                     /note="precursor polypeptide (AA -22 to 82) (2664 is 1st
                     base in codon)"
                     /codon_start=2352
     CDS             2352..2417
                     /note="signal peptide (AA -22 to -1)"
                     /codon_start=2352
     CDS             2418..2664
                     /note="alpha-phaseolin (AA 1-82)"
                     /codon_start=2418
     intron          2665..2736
                     /note="intron I"
     mRNA            2737..2927
                     /note="exon 2"
     CDS             2737..2927
                     /note="alpha-phaseolin (AA 83-146) (2737 is 2nd base in
                     codon)"
                     /codon_start=2737
     intron          2928..3015
                     /note="intron II"
     mRNA            3016..3096
                     /note="exon 3"
     CDS             3016..3096
                     /note="alpha-phaseolin (AA 147-173)"
                     /codon_start=3016
     intron          3097..3220
                     /note="intron III"
     mRNA            3221..3451
                     /note="exon 4"
     CDS             3221..3451
                     /note="alpha-phaseolin (AA 174-250)"
                     /codon_start=3221
     intron          3452..3579
                     /note="intron IV"
     mRNA            3580..3838
                     /note="exon 5"
     CDS             3580..3838
                     /note="alpha-phaseolin (AA 251-336) (3838 is 1st base in
                     codon)"
                     /codon_start=3580
     intron          3839..3941
                     /note="intron V"
     mRNA            3942..>3941
                     /note="exon 6"
     CDS             3942..4156
                     /note="alpha-phaseolin (AA 337-408) (3942 is 2nd base in
                     codon)"
                     /codon_start=3942
     misc_feature    4084..4137
                     /note="repeat region"
BASE COUNT     1648 a    897 c    782 g   1437 t
ORIGIN
        1 ttaacccact ttctccccac tttctcccca cctcatgccc ctccccaacc aagatgaaca
       61 ctcacttgac atttccatga aaagattccc ttggtctgtg caggaagtgt aatgtcgatc
      121 accacatatt cctactgaaa tcaccacata ccgccactct ataacttgag tgaattttat
      181 ggttagttca cactagccta ctatcatggc cctataaaga cctcatttaa cccactttct
      241 ccccacctca tgccctccaa aaccaacacg gacactcact tgacatttca gaaaaaattt
      301 ccctgggtat gtgcaggaag tctaatgtgg gtcaccatat attcctacta tggtcaccat
      361 atattgtcac cctttaagtt caatgaattt tttgcttact caacaatgcc ctacactctt
      421 ggcctataaa aaccttatgt aactcacttt ctctctacat catgccccac ccaaccaaaa
      481 tagcttcaga aaatgtattt tttttgaaga aaaaaaaaac taaaaaagca tacatagtta
      541 cacaaactaa gtaaatgcaa atatatattt ttaacaatat attcaaaaaa aaaaacaagt
      601 catgctccat aaaaaactaa atagtaaact cagataacta taaccaaatc acatttaaac
      661 acagacttta actcatacaa gggtcaacac atgacaaata acacagatca cccaatacaa
      721 agcacaatat gaatgtttca cttaataaag aaatgaatag aaaatgtatt acctttggtc
      781 accaatccct aaccccacgg tgagagagaa ggtggaggaa aaaccttgag aagaagatgg
      841 aagaggacga cacagacaag aacatgaaga aaaatgaaga agaataaaga agaatgcgca
      901 cacacttgca acatcccatg gagaattaaa gggaacaaaa acggaacgaa caaaaaatga
      961 atggtaagaa cacaggaaaa aggaaatagg gacgtgaaga agaaaagtga agaagtgccc
     1021 caatttctta gctctaattt cttcacattc tgaccctaaa ataaaatgtt aaatctattc
     1081 aacacgtaac ccacaactgc aataaaataa ttttgattgt cacataggcg attatatggt
     1141 gaacaagtgc aggaccgatt gttcaaccat cattttttgt ataaataatg atgaagatct
     1201 cattacaata gcattatatt catattaaag aataggaaaa aattcatata tacttagcac
     1261 tatattataa taatataata atatttttaa ttaaaaaata aattaaatat gattaaaata
     1321 ttattttatt gatttgtgaa atagattttt tttattgatt gtggtgtgaa aaccatcacc
     1381 gtcctaaaga agaaagaata taagagtggt aagctgagtc gactagtctt ttaacgaggt
     1441 gttggacact ttggtaagta cgtcttttgg atatacatgg attctgtgat cttgaacagc
     1501 tggttgataa agaaaaggaa gtgaaggtga taacggagga gagatccgct atacacaagt
     1561 gataagaaca aattactgac atcaaaactt ggaaaattgc aaacgaatgc cattcaaaac
     1621 ccacatggga cataattctt acatacaatt acattcattt aacatcttaa aaaaaaatca
     1681 tttaacatct taaaaaaatt attatattgt tttattataa taggatattt tgttctatta
     1741 aaattttatg gaatatatgt aattcttaag aataatgtta agagtaaatt tttcaaaatt
     1801 tatttgacaa aaaaaatggt aaaacatata ctattaagtt gatttttcgt aaaatagaga
     1861 caaatgcaac cgcacccttc ttcaatcaca caaatttggt tcaggttgtc atggcactct
     1921 gtagtcgttt ggttcatgca tgggtcttac gcaagaaaaa gacaaagaaa aaagccaaaa
     1981 cagagagatc gccgcgtcca tgtatgtcta aatgccatgc acagcaacac gtgcttaaca
     2041 tgcactttaa atggctcacc atctcaaccc acacacaaac acaatgcctt tttcttcatc
     2101 atcaccacaa ccacctgtat atattcattc tcttccgcca cctcaatttc ttcacttcaa
     2161 cacacgtgaa cctgcatatg cgtgtcatcc catgcccaaa tctccatgca tgttccaacc
     2221 accttctctc ttatataata cctataaata cccctaatat cactcacttc tttcatcatc
     2281 catccatcca gagtactact actctactac tataataccc caacccaact catattcaat
     2341 actactctac tatgatgaga gcaagggttc cactcctgtt gctgggaatt cttttcctgg
     2401 catcactttc tgcctcattt gccacttcac tccgggagga ggaagagagc caagataacc
     2461 ccttctactt caactctgac aactcctgga acactctatt caaaaaccaa tatggtcaca
     2521 ttcgtgtcct ccagaggttc gaccaacaat ccaaacgact tcagaatctt gaagactacc
     2581 gtcttgtgga gttcaggtcc aaacccgaaa ccctccttct tcctcagcag gctgatgctg
     2641 agttactcct agttgtccgt agtggtaagt aattgctact ggtatcactt gtttcttctt
     2701 gcagaaataa tggtaatgag ttttttataa tttcagggag cgccatactc gtcttggtga
     2761 aacctgatga tcgcagagag tacttcttcc ttactagcga taacccgata ttctctgatc
     2821 accagaaaat ccctgcagga accattttct atttggttaa ccctgatccc aaagaggatc
     2881 tcagaataat ccaactcgcc atgcccgtta acaaccctca gattcatgta ctgccttttg
     2941 taatactgaa ctaatttttt gttattttaa cttgcaattt ctctccaaat gtgatgataa
     3001 atgtttgtcc tgcaggactt tttcctatct agcacagaag cccaacaatc ctacttgcaa
     3061 gagttcagca agcatattct agaggcctcc ttcaatgtaa gaaagaaaac agcatctaac
     3121 tacatatttg cgttgccatt tagctagtac tttgtctaaa tgtcacactt gttgaatttg
     3181 ttgaatgata tcattatata tgtttgcatg atttttatag agcaaattcg aggagatcaa
     3241 cagggttctg tttgcagagg agggacagca agagggagtg attgtgaaca ttgattctga
     3301 acagattgag gaactgagca aacatgcaaa atctagttca aggaaatccc tttccaaaca
     3361 agataacaca attggaaacg aatttggaaa cctgactgag aggaccgata actccttgaa
     3421 tgtgttaatc agttctatgg agatgaaaga ggtaaataca aagaaaaacc atatagacaa
     3481 actcagcaat tgagttctat tattcactgt cgtcttggtt agaaaatctt agtattgaga
     3541 ctataattaa ataatggttt tttttgttaa caaatttagg gagctctttt tgtgccacac
     3601 tactattcta aggccattgt tatactagtg gttaatgaag gagaagcaca tgttgaactt
     3661 gttggcccaa aaggaaataa ggaaaccttg gaatatgaga gctacagagc tgagctttct
     3721 aaagacgatg tatttgtaat cccagcagca tatccagttg ccatcaaggc tacctccaac
     3781 gtgaatttca ctggtttcgg tatcaatgct aataacaaca ataggaacct ccttgcaggt
     3841 atatatattt attatatatg accatgaatt tgaatatagg gttgttgatg ggatttttta
     3901 tttataattg gtaatgcgtg attgtgattg aaaatatgaa ggtaagacgg acaatgtcat
     3961 aagcagcatc ggtagagctc tggacggtaa agacgtgttg gggcttacgt tctctgggtc
     4021 tggtgaagaa gttatgaagc tgatcaacaa gcagagtgga tcgtactttg tggatggaca
     4081 ccatcaccaa caggaacagc aaaagggaag tcaccaacag gaacagcaaa agggaagaaa
     4141 gggtgcattt gtgtactgaa taagtatgaa ctaaaatgca tgtatggtgt aagagctcat
     4201 ggagagcatg gaaatatgta tccgaccatg taacactata ataactgagc tccatctcac
     4261 ttcttctatg aataaacaaa ggatgttatg atatattaac actatatgca ccttcactag
     4321 taatacatta atatttaata ctttttattt taacttttta gtttaaaata ttattatatt
     4381 attaactttt tagtttaaaa tatttatatt attataaaga gaaataaaca aaggatgtta
     4441 tgatatatta acactatatg taccttacat agtaatatat taatatttaa tactttttat
     4501 tttaactttt taatttaaaa tattattata aatgacgctt gtgttttatg tgttggcatg
     4561 cttgtatttt atgtgttgac tttctgtgtg aaggtaatgt gatatggtga gctggtggta
     4621 acaattgtgt tttatgtgtt ggctttctgt gaagctaatt tgatatggtt agctgatgtg
     4681 aacaaaatat taaaggaagc taatttgata tggttagccg atagtaacaa aatatcaaaa
     4741 taaatttctt cttactttaa taaa
//