[bionet.molbio.genbank.updates] Pea legA gene for legumin

GenBank-Updates@genbank.bio.net (05/26/91)

LOCUS       PEALEGAG     3347 bp ds-DNA             PLN       26-MAY-1991
DEFINITION  Pea legA gene for legumin
ACCESSION   X02982 X00634
KEYWORDS    legumin; seed storage protein; storage protein.
SOURCE      Pisum sativum DNA.
  ORGANISM  Pisum sativum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1037 to 3347)
  AUTHORS   Lycett,G.W., Croy,R.R., Shirsat,A.H. and Boulter,D.
  TITLE     The complete nucleotide sequence of a legumin gene from pea (Pisum
            sativum L.)
  JOURNAL   Nucleic Acids Res. 12, 4493-4506 (1984)
  STANDARD  full automatic
REFERENCE   2  (bases 1 to 1249)
  AUTHORS   Lycett,G.W., Croy,R.R., Shirsat,A.H., Richards,D.M. and Boulter,D.
  TITLE     The 5'-flanking regions of three legumin genes: comparison of the
            DNA sequences
  JOURNAL   Nucleic Acids Res. 13, 6733-6743 (1985)
  STANDARD  full automatic
COMMENT     EPD; 14005; Ps legumin legA. SWISS-PROT; P02857; LEGA$PEA.
            
            From EMBL    entry PSLEGAG;  dated 06-JUL-1989.
FEATURES             Location/Qualifiers
     repeat_region   180..202
                     /note="imp. direct repeat 1"
     repeat_region   214..236
                     /note="imp. direct repeat 1"
     repeat_region   686..690
                     /note="direct repeat 2"
     repeat_region   800..815
                     /note="direct repeat 3"
     repeat_region   821..837
                     /note="direct repeat 3"
     repeat_region   833..837
                     /note="direct repeat 2"
     repeat_region   904..908
                     /note="direct repeat 4"
     repeat_region   914..918
                     /note="direct repeat 4"
     misc_feature    1047..1056
                     /note="sequence homologous to adenovirus enhancer"
     misc_feature    1060..1067
                     /note="sequence homologous to SV40 enhancer"
     RBS             1099..1114
                     /note="put. rRNA binding region"
     promoter        1115..1120
                     /note="put. CAAT-box"
     promoter        1170..1180
                     /note="put. TATA-box"
     misc_feature    1204..1204
                     /note="transcription start site"
     CDS             1238..1523
                     /note="legumin alpha part 1"
                     /codon_start=1238
     intron          1524..1611
                     /note="intron I"
     CDS             1612..1862
                     /note="legumin alpha part 2"
                     /codon_start=1612
     intron          1863..1950
                     /note="intron II"
     CDS             1951..2409
                     /note="legumin alpha part 3"
                     /codon_start=1951
     CDS             2410..2577
                     /note="legumin beta part 1"
                     /codon_start=2410
     intron          2578..2706
                     /note="intron III"
     CDS             2707..3063
                     /note="legumin beta part 2"
                     /codon_start=2707
     misc_feature    3095..3102
                     /note="pot. polyA signal"
     misc_feature    3095..3102
                     /note="pot. polyA signal"
     misc_feature    3172..3180
                     /note="pot. polyA signal"
     misc_feature    3186..3197
                     /note="pot. polyA signal"
     repeat_unit     3250..3260
                     /note="inverted repeat"
     repeat_unit     3332..3343
                     /note="inverted repeat"
BASE COUNT     1140 a    612 c    649 g    946 t
ORIGIN
        1 ggatccttta gaattatttt tttaggtctc aatagattaa gaagttggcg tctcattgat
       61 tgaccatgga caatttgaaa gaaaaaaaag atcacctttg ttttttagag gaaaaaggaa
      121 gcaattaagt agagaaaaca aaaagaataa atggaagaag ttgaggaaat ctatatttac
      181 acgatcaatt agtatgtgtt aagagtcatg tatcatgatc aattagtatg tgttaaagtc
      241 ttgtatcaga taatatataa tccaaatata tttttctaaa tgaggacaaa tctaacctta
      301 caaataagtt ttttagagtt aaattagatt caatcacatt ttatttttta ttttttgaac
      361 agtaagaaat aagatctata ttttcttctc tatttgttta cgtccataca aaaaatgtgc
      421 aatgattgtg aaagatgtca tgcatatgca gtcaccatat attatttaca taaaaagaac
      481 tacttattct ttcggcctca aattttacct aggaattatg tatgcaaata tgaaatattc
      541 atggactttt ttcgtccatt ctttctctgg aaattactcc ctatgtttat agaatttgaa
      601 aacttttgag taaattagca ctttaaatgt aaaagtatgg catcttatca aacaaccggt
      661 tgatgaaaat ttcacatttt caggtagtaa tatgaaattg ataatggaaa gatgatatag
      721 tattaataat aaatatattt gaaaagataa caataaatgt attatatcta taaatttaca
      781 aggttcttat atttacatac aaacaaatgc agtaatgttt caaacaatat gcagtaagta
      841 attaacactt taatttgaag gattaatcaa tttggtaact gaagtagcta attgaaagtt
      901 tattctttat aaatctttgt aatgcagaat atgtaagaaa gaaacatgga gtataagaag
      961 taaagccatg gtcccctgcc accgatttca gctataagaa ttgcaagtat gctctttgtc
     1021 tggtaatgga gatgatgaag ccattagcca cctcctctat cagacatagg tgtaaagcat
     1081 tatgcttcca tagccatgca agctgcagaa tgtccaattc tcaacatccc actttcaatg
     1141 acgtgtccaa ccttcaccac cctctcttct ctataaatta ccacttctca ttaaggttct
     1201 ccgcatcaca accaacattc tcttagtatc tctcttcatg gctaagcttc ttgcactttc
     1261 tctttcattc tgttttctac ttttgggtgg ctgttttgct ttgagagaac agccacagca
     1321 aaatgagtgc cagctagaac gcctcgatgc cctcgagcct gataaccgta tagaatcgga
     1381 aggtgggctc attgagactt ggaatcccaa caacaagcaa ttccgatgtg ctggtgtggc
     1441 cctctctcgt gctacccttc aacgcaacgc ccttcgcaga ccttactact ccaatgctcc
     1501 ccaagaaatt ttcatccaac aaggttactt attttgatct tataccaact tctttacgta
     1561 cattacatgc atattagcat actaattagt gttctactat accaattaca ggtaatggat
     1621 attttggcat ggtattcccc ggttgtcctg agacctttga agagccacaa gaatctgaac
     1681 aaggagaggg acgcaggtac agagacagac atcaaaaggt taaccgattc agagagggtg
     1741 atatcattgc agttcctact ggtattgtat tttggatgta caacgaccaa gacactccag
     1801 ttattgccgt ctctcttact gacattagaa gctccaataa ccagcttgat cagatgccta
     1861 gggtgagcac tgagcataat taaacttccc atataagata atatgttgtc caaaacagta
     1921 acatagattc tatctatcta tgtttgacag agattctatc ttgctgggaa ccacgagcaa
     1981 gagtttctac aataccagca tcaacaagga ggaaagcaag aacaagaaaa tgaaggcaac
     2041 aacattttca gtggcttcaa gagggattac ttggaagatg ctttcaacgt gaacaggcat
     2101 atagtagaca gacttcaagg caggaatgaa gacgaagaga agggagccat tgtcaaagtg
     2161 aaaggtggac tcagcatcat aagcccaccc gagaagcaag cgcgccacca gagaggcagc
     2221 agacaagagg aagatgaaga tgaagagaag cagccgcgcc accagagagg cagcagacaa
     2281 gaggaagagg aagatgaaga tgaagagagg cagccgcgtc atcaaaggag aagaggagag
     2341 gaggaagaag aagacaagaa agagcgcggc ggcagccaaa aaggcaaaag cagaaggcaa
     2401 ggagacaatg ggcttgagga aacagtttgc actgctaaac ttcgattgaa cattggcccg
     2461 tcttcatcac cagacatcta caaccctgaa gctggtagaa tcaaaactgt taccagcctg
     2521 gacctcccag ttctcaggtg gctcaaacta agtgctgagc atggatctct ccacaaagta
     2581 tgttttttca tcatttaatt tgtttttcca tgaatcaatt tcatgtcgaa ctatgtgttg
     2641 gagaataata gctaactcat tacaatcttc atacagaatg ctatgtttgt gcctcactac
     2701 aacctgaatg caaacagtat aatatacgca ttgaagggac gtgcaaggct acaagtagtg
     2761 aactgcaatg gcaacaccgt gtttgatgga gagctagaag ccggacgtgc attgacagtg
     2821 ccacaaaact atgctgtggc tgcaaagtca ctaagcgaca ggttctcata tgtagcattc
     2881 aagaccaatg atagagctgg tattgcaaga cttgcaggga catcatcagt tataaataat
     2941 ctgccgttgg atgttgttgc agctacattc aacctgcaga ggaatgaggc aaggcagctc
     3001 aagtccaaca atcccttcaa atttctagtt ccagctcgtg agtctgagaa cagagcttcg
     3061 gcttagattt ggcaccaaat caatgaaagt aatgaataag aaaactaagg cttagatgcc
     3121 tttgttactt gtgtaaaata actcgagtca tgtacctttt agcggaaaca gaataaataa
     3181 aaggtaaaat ttcagtgctc tatgcttttc tactccaagt tataaccaga tgatatatat
     3241 aacaatcaca ataaataaat gtgagtaaaa aaatattgaa gaaaaatgat gtattgaaat
     3301 tataactagc cggattggat ttaaggagtt acaatgaaat tttttga
//