GenBank-Updates@genbank.bio.net (05/18/91)
LOCUS ECOTYRA 4509 bp ds-DNA BCT 18-MAY-1991
DEFINITION E. coli K12 genes pheA, tyrA and aroF for chorismate
mutase/prephenate dehydratase (EC 5.4.99.5/4.2.1.51) chorismate
mutase/prephenate dehydrogenase (EC 5.4.99.5/1.3.1.12) and
DAHP-synthetase (EC 4.1.2.15)
ACCESSION X02137 M10431
KEYWORDS attenuator; dehydrogenase; mutase; signal peptide; synthetase;
unidentified reading frame.
SOURCE Escherichia coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 4509)
AUTHORS Hudson,G.S. and Davidson,B.E.
TITLE Nucleotide sequence and transcription of the phenylalanine and
tyrosine operons of Escherichia coli K12
JOURNAL J. Mol. Biol. 180, 1023-1051 (1984)
STANDARD full automatic
COMMENT SWISS-PROT; P00888; AROF$ECOLI. SWISS-PROT; P07022; PHEA$ECOLI.
SWISS-PROT; P07023; TYRA$ECOLI. SWISS-PROT; P11285; YPHE$ECOLI.
SWISS-PROT; P11289; YARF$ECOLI.
From EMBL 26 entry ECTYRA; dated 22-FEB-1991.
FEATURES Location/Qualifiers
misc_feature 6..6
/note="start of URF1 mRNA"
RBS 24..27
/note="pot. Shine-Dalgarno sequence"
CDS 36..374
/note="URF1 (aa 1-113)"
/codon_start=36
repeat_unit 401..428
/note="inverted repeat 1"
misc_signal 401..420
/note="pot. URF1 terminator stem-loop structure"
repeat_unit 417..420
/note="inverted repeat 1'"
promoter 426..431
/note="pot. -35 region (pheA)"
repeat_unit 440..446
/note="inverted repeat 2"
repeat_unit 441..448
/note="inverted repeat 3"
repeat_unit 449..455
/note="inverted repeat 2'"
promoter 450..455
/note="pot. -10 bp region (pheA)"
repeat_unit 452..458
/note="inverted repeat 3'"
misc_feature 461..461
/note="start of pheA mRNA"
RBS 468..473
/note="pot. Shine-Dalgarno sequence"
sig_peptide 481..525
/note="pot. signal peptide (aa -15 to -1)"
attenuator 501..535
/note="pot. attenuator region"
repeat_unit 501..510
/note="inverted repeat 4"
repeat_unit 527..535
/note="inverted repeat 4'"
misc_feature 575..599
/note="pot. pheA attenuator terminator stem-loop
structure"
repeat_unit 575..583
/note="inverted repeat 5"
repeat_unit 590..599
/note="inverted repeat 5'"
RBS 616..619
/note="pot. Shine-Dalgarno sequence"
CDS 627..1784
/product="pheA polypeptide"
/gene="pheA"
/codon_start=627
misc_feature complement(1795..1822)
/note="pot. tyr. operon terminator stem-loop structure"
repeat_unit 1795..1806
/note="inverted repeat 6"
repeat_unit 1811..1822
/note="inverted repeat 6'"
CDS complement(1833..2951)
/product="tyrA polypeptide"
/gene="tyrA"
/codon_start=2951
RBS complement(2957..2961)
/note="pot. Shine-Dalgarno sequence"
CDS complement(2965..4032)
/product="aroF polypeptide"
/gene="aroF"
/codon_start=4032
RBS complement(4041..4044)
/note="pot. Shine-Dalgarno sequence"
repeat_region complement(4069..4074)
/note="incomplete inverted repeat 1'"
repeat_region complement(4078..4083)
/note="incomplete inverted repeat 1"
misc_feature 4083..4083
/note="tyr operon polycistronic transcript start site"
promoter complement(4090..4095)
/note="pot. -10 bp (tyr operon)"
promoter complement(4112..4117)
/note="pot. -35 bp region (tyr operon)"
RBS 4231..4235
/note="pot. Shine-Dalgarno sequence"
CDS 4242..4509
/note="pot. URF2"
/codon_start=4242
BASE COUNT 1210 a 1177 c 1043 g 1079 t
ORIGIN
1 aattcaccaa gacgggaaga caagaggtaa aatttatgac aatgaacatt accagcaaac
61 aaatggaaat tactccggcc atccgccaac atgtcgcaga ccgtctcgcc aaactggaaa
121 aatggcaaac acatctgatt aatccacata tcattctgtc caaagagcca caagggtttg
181 ttgctgacgc cacaatcaat acacctaacg gcgttctggt tgccagtggt aaacatgaag
241 atatgtacac cgcaattaac gaattgatca acaagctgga acggcagctc aataaactgc
301 agcacaaagg cgaagcacgt cgtgccgcaa catcggtgaa agacgccaac ttcgtcgaag
361 aagttgaaga agagtagtcc tttatattga gtgtatcgcc aacgcgcctt cgggcgcgtt
421 ttttgttgac agcgtgaaaa cagtacgggt actgtactaa agtcacttaa ggaaacaaac
481 atgaaacaca taccgttttt cttcgcattc ttttttacct tcccctgaat gggaggcgtt
541 tcgtcgtgtg aaacagaatg cgaagacgaa caataaggcc tcccaaatcg gggggccttt
601 tttattgata acaaaaaggc aacactatga catcggaaaa cccgttactg gcgctgcgag
661 agaaaatcag cgcgctggat gaaaaattat tagcgttact ggcagaacgg cgcgaactgg
721 ccgtcgaggt gggaaaagcc aaactgctct cgcatcgccc ggtacgtgat attgatcgtg
781 aacgcgattt gctggaaaga ttaattacgc tcggtaaagc gcaccatctg gacgcccatt
841 acattactcg cctgttccag ctcatcattg aagattccgt attaactcag caggctttgc
901 tccaacaaca tctcaataaa attaatccgc actcagcacg catcgctttt ctcggcccca
961 aaggttctta ttcccatctt gcggcgcgcc agtatgctgc ccgtcacttt gagcaattca
1021 ttgaaagtgg ctgcgccaaa tttgccgata tttttaatca ggtggaaacc ggccaggccg
1081 actatgccgt cgtaccgatt gaaaatacca gctccggtgc cataaacgac gtttacgatc
1141 tgctgcaaca taccagcttg tcgattgttg gcgagatgac gttaactatc gaccattgtt
1201 tgttggtctc cggcactact gatttatcca ccatcaatac ggtctacagc catccgcagc
1261 cattccagca atgcagcaaa ttccttaatc gttatccgca ctggaagatt gaatataccg
1321 aaagtacgtc tgcggcaatg gaaaaggttg cacaggcaaa atcaccgcat gttgctgcgt
1381 tgggaagcga agctggcggc actttgtacg gtttgcaggt actggagcgt attgaagcaa
1441 atcagcgaca aaacttcacc cgatttgtgg tgttggcgcg taaagccatt aacgtgtctg
1501 atcaggttcc ggcgaaaacc acgttgttaa tggcgaccgg gcaacaagcc ggtgcgctgg
1561 ttgaagcgtt gctggtactg cgcaaccaca atctgattat gacccgtctg gaatcacgcc
1621 cgattcacgg taatccatgg gaagagatgt tctatctgga tattcaggcc aatcttgaat
1681 cagcggaaat gcaaaaagca ttgaaagagt taggggaaat cacccgttca atgaaggtat
1741 tgggctgtta cccaagtgag aacgtagtgc ctgttgatcc aacctgatga aaaggtgccg
1801 gatgatgtga atcatccggc actggattat tactggcgat tgtcattcgc ctgacgcaat
1861 aacacgcggc tttcactctg aaaacgctgt gcgtaatcgc cgaaccagtg ctccaccttg
1921 cggaaactgt caataaacgc ctgcttatcg ccctgctcca gcaactcaat cgcctcgccg
1981 aaacgcttat agtaacgttt gattaacgcc agattacgct ctgacgacat aatgatgtcg
2041 gcataaagct gcggatcctg agcaaacagt cgcccgacca tcgccagctc aaggcggtaa
2101 atcggcgaag agagcgccag aagttgctca agctgaacat tttcttctgc caggtgcagc
2161 ccgtaagcaa aagtagcaaa gtggcgcagt gcctgaataa acgccatatt ctgatcgtgc
2221 tcgacggcgc taatacgatg cagccgagcg ccccagacct gaatttgctc cagaaaccat
2281 tggtatgctt ccggtttacg tccatcacac cagaccacaa cttgctttgc caggctaccg
2341 ctgtccggac cgaacatcgg gtgtagcccc agcaccggac catcatgcgc caccagcatg
2401 gcctgtaatg gcccattttt cactgatgcc agatcgacca gaatacaatc tttcggtaaa
2461 ggcggtaatt tgccaataac ttgctcagta acgtggattg gcacactaac aatcaccatt
2521 ccggcatcgg caacaatatc agccgctcga tcccagtcat gttgctccag aatccgcacc
2581 tgataacccg agagggtcag catcttctcg aacaggcgtc ccatctgacc gccaccgccg
2641 acgataacca ccggacgcag tgacggacaa agtgttttaa atcctttgtc gttttcactg
2701 gagtaagatt cacgcatcac ccgacgcaaa acatcctcaa tcagatctgg cggtacaccc
2761 agagcttccg cctctgcacg acgcgaggcc aacatagatg cctcgcgctc cggaacataa
2821 ataggcagtc caaagcggct tttcacctcg cccacttcag caaccagttc cagacgcttc
2881 gctaataaat tcagcagcgc tttatcgact tcatcaattt gatcgcgtaa tgcggtcaat
2941 tcagcaacca taataaacct cttaagccac gcgagccgtc agctgcccgt tcagatcctg
3001 atgaatttca cgcagcaagg catcggtcat ttcccagcta atgcaggcat cggttacgga
3061 tacaccgtat ttcatttcac tgcgcggttg ctcggaagac tgattgccct cgtggatatt
3121 actttcgatc atcagaccaa taattgagcg attgccatct ttgatttgag caaccacgga
3181 ttctgccacc gcaggctgac ggcgataatc tttattggaa ttaccgtggc tgcaatctac
3241 catcagagac gggcgcagtc ccgcctgttc catctctttt tcacattgcg caacatccgc
3301 agggctatag ttcggcgctt taccaccgcg caggatcaca tggccgtccg gattcccctg
3361 agtttgtagc aacgcaacct gccctgcctg gttaatgcca acaaaacggt gcggctgggc
3421 ggcggcgcgc atagcgttaa ttgctgttgc cagactgccg tcggtgccgt ttttaaaacc
3481 aaccggcatg gaaagcccgg aggccatttc acggtgagtt tgcgattccg ttgtacgagc
3541 accaattgct gaccagctaa acagatcgcc caggtattgc gggctattcg gatctaacgc
3601 ttccgtcgcc agtggcagtc ccatattcac cagctcaagc agcaatttac gcgcgatctg
3661 cagcccggct tctacatcaa aagagccatc catatgggga tcgttaatta accctttcca
3721 gccgacagtg gtacggggtt tttcaaaata gacgcgcatt accagataga ggctatcgct
3781 gacctctgcg gcaagggctt taaatcgacg agcatattcc agagcagttt ccggatcatg
3841 aatggaacaa ggaccacata ctaccagcag acgaggatcg cgcccggcga taatatctga
3901 aatgctttta cgcgagtcag caatctgggc ttcttgttgc aggctcaatg gaaaagcggc
3961 cttcagttgt tccggagtca ttaaaacctg ttcgtcggta atatgtacgt tattcagcgc
4021 gtctttttgc atgatggcga tcctgtttat gctcgtttgc gatagttgat cctcagcgag
4081 gatgacgtaa cgataacaca taaagtaaag ttttcaatcc atatttcgta catttttatt
4141 tacacaggca atttagtcgc gctttcaacc cttacctctg tatagataaa tttacactcc
4201 ctttgaaaac aatccgctat gctttgaaaa aggagaaaga aatgatgaaa aagtttatcg
4261 cccccttgtt ggctttactg gttagcggat gtcagattga tccttatact cacgcgccaa
4321 ccttgaccag caccgactgg tatgatgtcg gtatggaaga tgcgatatcg ggcagcgcca
4381 taaaagatga cgatgcattt agcgattcac aggcggatcg cggtctatac cttaaaggat
4441 atgccgaagg acaaaagaaa acttgccaga ccgattttac ttatgcccga ggactttccg
4501 gtaaaagct
//