GenBank-Updates@genbank.bio.net (05/26/91)
LOCUS PEALEGAG 3347 bp ds-DNA PLN 26-MAY-1991
DEFINITION Pea legA gene for legumin
ACCESSION X02982 X00634
KEYWORDS legumin; seed storage protein; storage protein.
SOURCE Pisum sativum DNA.
ORGANISM Pisum sativum
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Rosidae; Rosales; Fabaceaea.
REFERENCE 1 (bases 1037 to 3347)
AUTHORS Lycett,G.W., Croy,R.R., Shirsat,A.H. and Boulter,D.
TITLE The complete nucleotide sequence of a legumin gene from pea (Pisum
sativum L.)
JOURNAL Nucleic Acids Res. 12, 4493-4506 (1984)
STANDARD full automatic
REFERENCE 2 (bases 1 to 1249)
AUTHORS Lycett,G.W., Croy,R.R., Shirsat,A.H., Richards,D.M. and Boulter,D.
TITLE The 5'-flanking regions of three legumin genes: comparison of the
DNA sequences
JOURNAL Nucleic Acids Res. 13, 6733-6743 (1985)
STANDARD full automatic
COMMENT EPD; 14005; Ps legumin legA. SWISS-PROT; P02857; LEGA$PEA.
From EMBL entry PSLEGAG; dated 06-JUL-1989.
FEATURES Location/Qualifiers
repeat_region 180..202
/note="imp. direct repeat 1"
repeat_region 214..236
/note="imp. direct repeat 1"
repeat_region 686..690
/note="direct repeat 2"
repeat_region 800..815
/note="direct repeat 3"
repeat_region 821..837
/note="direct repeat 3"
repeat_region 833..837
/note="direct repeat 2"
repeat_region 904..908
/note="direct repeat 4"
repeat_region 914..918
/note="direct repeat 4"
misc_feature 1047..1056
/note="sequence homologous to adenovirus enhancer"
misc_feature 1060..1067
/note="sequence homologous to SV40 enhancer"
RBS 1099..1114
/note="put. rRNA binding region"
promoter 1115..1120
/note="put. CAAT-box"
promoter 1170..1180
/note="put. TATA-box"
misc_feature 1204..1204
/note="transcription start site"
CDS 1238..1523
/note="legumin alpha part 1"
/codon_start=1238
intron 1524..1611
/note="intron I"
CDS 1612..1862
/note="legumin alpha part 2"
/codon_start=1612
intron 1863..1950
/note="intron II"
CDS 1951..2409
/note="legumin alpha part 3"
/codon_start=1951
CDS 2410..2577
/note="legumin beta part 1"
/codon_start=2410
intron 2578..2706
/note="intron III"
CDS 2707..3063
/note="legumin beta part 2"
/codon_start=2707
misc_feature 3095..3102
/note="pot. polyA signal"
misc_feature 3095..3102
/note="pot. polyA signal"
misc_feature 3172..3180
/note="pot. polyA signal"
misc_feature 3186..3197
/note="pot. polyA signal"
repeat_unit 3250..3260
/note="inverted repeat"
repeat_unit 3332..3343
/note="inverted repeat"
BASE COUNT 1140 a 612 c 649 g 946 t
ORIGIN
1 ggatccttta gaattatttt tttaggtctc aatagattaa gaagttggcg tctcattgat
61 tgaccatgga caatttgaaa gaaaaaaaag atcacctttg ttttttagag gaaaaaggaa
121 gcaattaagt agagaaaaca aaaagaataa atggaagaag ttgaggaaat ctatatttac
181 acgatcaatt agtatgtgtt aagagtcatg tatcatgatc aattagtatg tgttaaagtc
241 ttgtatcaga taatatataa tccaaatata tttttctaaa tgaggacaaa tctaacctta
301 caaataagtt ttttagagtt aaattagatt caatcacatt ttatttttta ttttttgaac
361 agtaagaaat aagatctata ttttcttctc tatttgttta cgtccataca aaaaatgtgc
421 aatgattgtg aaagatgtca tgcatatgca gtcaccatat attatttaca taaaaagaac
481 tacttattct ttcggcctca aattttacct aggaattatg tatgcaaata tgaaatattc
541 atggactttt ttcgtccatt ctttctctgg aaattactcc ctatgtttat agaatttgaa
601 aacttttgag taaattagca ctttaaatgt aaaagtatgg catcttatca aacaaccggt
661 tgatgaaaat ttcacatttt caggtagtaa tatgaaattg ataatggaaa gatgatatag
721 tattaataat aaatatattt gaaaagataa caataaatgt attatatcta taaatttaca
781 aggttcttat atttacatac aaacaaatgc agtaatgttt caaacaatat gcagtaagta
841 attaacactt taatttgaag gattaatcaa tttggtaact gaagtagcta attgaaagtt
901 tattctttat aaatctttgt aatgcagaat atgtaagaaa gaaacatgga gtataagaag
961 taaagccatg gtcccctgcc accgatttca gctataagaa ttgcaagtat gctctttgtc
1021 tggtaatgga gatgatgaag ccattagcca cctcctctat cagacatagg tgtaaagcat
1081 tatgcttcca tagccatgca agctgcagaa tgtccaattc tcaacatccc actttcaatg
1141 acgtgtccaa ccttcaccac cctctcttct ctataaatta ccacttctca ttaaggttct
1201 ccgcatcaca accaacattc tcttagtatc tctcttcatg gctaagcttc ttgcactttc
1261 tctttcattc tgttttctac ttttgggtgg ctgttttgct ttgagagaac agccacagca
1321 aaatgagtgc cagctagaac gcctcgatgc cctcgagcct gataaccgta tagaatcgga
1381 aggtgggctc attgagactt ggaatcccaa caacaagcaa ttccgatgtg ctggtgtggc
1441 cctctctcgt gctacccttc aacgcaacgc ccttcgcaga ccttactact ccaatgctcc
1501 ccaagaaatt ttcatccaac aaggttactt attttgatct tataccaact tctttacgta
1561 cattacatgc atattagcat actaattagt gttctactat accaattaca ggtaatggat
1621 attttggcat ggtattcccc ggttgtcctg agacctttga agagccacaa gaatctgaac
1681 aaggagaggg acgcaggtac agagacagac atcaaaaggt taaccgattc agagagggtg
1741 atatcattgc agttcctact ggtattgtat tttggatgta caacgaccaa gacactccag
1801 ttattgccgt ctctcttact gacattagaa gctccaataa ccagcttgat cagatgccta
1861 gggtgagcac tgagcataat taaacttccc atataagata atatgttgtc caaaacagta
1921 acatagattc tatctatcta tgtttgacag agattctatc ttgctgggaa ccacgagcaa
1981 gagtttctac aataccagca tcaacaagga ggaaagcaag aacaagaaaa tgaaggcaac
2041 aacattttca gtggcttcaa gagggattac ttggaagatg ctttcaacgt gaacaggcat
2101 atagtagaca gacttcaagg caggaatgaa gacgaagaga agggagccat tgtcaaagtg
2161 aaaggtggac tcagcatcat aagcccaccc gagaagcaag cgcgccacca gagaggcagc
2221 agacaagagg aagatgaaga tgaagagaag cagccgcgcc accagagagg cagcagacaa
2281 gaggaagagg aagatgaaga tgaagagagg cagccgcgtc atcaaaggag aagaggagag
2341 gaggaagaag aagacaagaa agagcgcggc ggcagccaaa aaggcaaaag cagaaggcaa
2401 ggagacaatg ggcttgagga aacagtttgc actgctaaac ttcgattgaa cattggcccg
2461 tcttcatcac cagacatcta caaccctgaa gctggtagaa tcaaaactgt taccagcctg
2521 gacctcccag ttctcaggtg gctcaaacta agtgctgagc atggatctct ccacaaagta
2581 tgttttttca tcatttaatt tgtttttcca tgaatcaatt tcatgtcgaa ctatgtgttg
2641 gagaataata gctaactcat tacaatcttc atacagaatg ctatgtttgt gcctcactac
2701 aacctgaatg caaacagtat aatatacgca ttgaagggac gtgcaaggct acaagtagtg
2761 aactgcaatg gcaacaccgt gtttgatgga gagctagaag ccggacgtgc attgacagtg
2821 ccacaaaact atgctgtggc tgcaaagtca ctaagcgaca ggttctcata tgtagcattc
2881 aagaccaatg atagagctgg tattgcaaga cttgcaggga catcatcagt tataaataat
2941 ctgccgttgg atgttgttgc agctacattc aacctgcaga ggaatgaggc aaggcagctc
3001 aagtccaaca atcccttcaa atttctagtt ccagctcgtg agtctgagaa cagagcttcg
3061 gcttagattt ggcaccaaat caatgaaagt aatgaataag aaaactaagg cttagatgcc
3121 tttgttactt gtgtaaaata actcgagtca tgtacctttt agcggaaaca gaataaataa
3181 aaggtaaaat ttcagtgctc tatgcttttc tactccaagt tataaccaga tgatatatat
3241 aacaatcaca ataaataaat gtgagtaaaa aaatattgaa gaaaaatgat gtattgaaat
3301 tataactagc cggattggat ttaaggagtt acaatgaaat tttttga
//