GenBank-Updates@genbank.bio.net (05/18/91)
LOCUS ECOTYRA 4509 bp ds-DNA BCT 18-MAY-1991 DEFINITION E. coli K12 genes pheA, tyrA and aroF for chorismate mutase/prephenate dehydratase (EC 5.4.99.5/4.2.1.51) chorismate mutase/prephenate dehydrogenase (EC 5.4.99.5/1.3.1.12) and DAHP-synthetase (EC 4.1.2.15) ACCESSION X02137 M10431 KEYWORDS attenuator; dehydrogenase; mutase; signal peptide; synthetase; unidentified reading frame. SOURCE Escherichia coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 4509) AUTHORS Hudson,G.S. and Davidson,B.E. TITLE Nucleotide sequence and transcription of the phenylalanine and tyrosine operons of Escherichia coli K12 JOURNAL J. Mol. Biol. 180, 1023-1051 (1984) STANDARD full automatic COMMENT SWISS-PROT; P00888; AROF$ECOLI. SWISS-PROT; P07022; PHEA$ECOLI. SWISS-PROT; P07023; TYRA$ECOLI. SWISS-PROT; P11285; YPHE$ECOLI. SWISS-PROT; P11289; YARF$ECOLI. From EMBL 26 entry ECTYRA; dated 22-FEB-1991. FEATURES Location/Qualifiers misc_feature 6..6 /note="start of URF1 mRNA" RBS 24..27 /note="pot. Shine-Dalgarno sequence" CDS 36..374 /note="URF1 (aa 1-113)" /codon_start=36 repeat_unit 401..428 /note="inverted repeat 1" misc_signal 401..420 /note="pot. URF1 terminator stem-loop structure" repeat_unit 417..420 /note="inverted repeat 1'" promoter 426..431 /note="pot. -35 region (pheA)" repeat_unit 440..446 /note="inverted repeat 2" repeat_unit 441..448 /note="inverted repeat 3" repeat_unit 449..455 /note="inverted repeat 2'" promoter 450..455 /note="pot. -10 bp region (pheA)" repeat_unit 452..458 /note="inverted repeat 3'" misc_feature 461..461 /note="start of pheA mRNA" RBS 468..473 /note="pot. Shine-Dalgarno sequence" sig_peptide 481..525 /note="pot. signal peptide (aa -15 to -1)" attenuator 501..535 /note="pot. attenuator region" repeat_unit 501..510 /note="inverted repeat 4" repeat_unit 527..535 /note="inverted repeat 4'" misc_feature 575..599 /note="pot. pheA attenuator terminator stem-loop structure" repeat_unit 575..583 /note="inverted repeat 5" repeat_unit 590..599 /note="inverted repeat 5'" RBS 616..619 /note="pot. Shine-Dalgarno sequence" CDS 627..1784 /product="pheA polypeptide" /gene="pheA" /codon_start=627 misc_feature complement(1795..1822) /note="pot. tyr. operon terminator stem-loop structure" repeat_unit 1795..1806 /note="inverted repeat 6" repeat_unit 1811..1822 /note="inverted repeat 6'" CDS complement(1833..2951) /product="tyrA polypeptide" /gene="tyrA" /codon_start=2951 RBS complement(2957..2961) /note="pot. Shine-Dalgarno sequence" CDS complement(2965..4032) /product="aroF polypeptide" /gene="aroF" /codon_start=4032 RBS complement(4041..4044) /note="pot. Shine-Dalgarno sequence" repeat_region complement(4069..4074) /note="incomplete inverted repeat 1'" repeat_region complement(4078..4083) /note="incomplete inverted repeat 1" misc_feature 4083..4083 /note="tyr operon polycistronic transcript start site" promoter complement(4090..4095) /note="pot. -10 bp (tyr operon)" promoter complement(4112..4117) /note="pot. -35 bp region (tyr operon)" RBS 4231..4235 /note="pot. Shine-Dalgarno sequence" CDS 4242..4509 /note="pot. URF2" /codon_start=4242 BASE COUNT 1210 a 1177 c 1043 g 1079 t ORIGIN 1 aattcaccaa gacgggaaga caagaggtaa aatttatgac aatgaacatt accagcaaac 61 aaatggaaat tactccggcc atccgccaac atgtcgcaga ccgtctcgcc aaactggaaa 121 aatggcaaac acatctgatt aatccacata tcattctgtc caaagagcca caagggtttg 181 ttgctgacgc cacaatcaat acacctaacg gcgttctggt tgccagtggt aaacatgaag 241 atatgtacac cgcaattaac gaattgatca acaagctgga acggcagctc aataaactgc 301 agcacaaagg cgaagcacgt cgtgccgcaa catcggtgaa agacgccaac ttcgtcgaag 361 aagttgaaga agagtagtcc tttatattga gtgtatcgcc aacgcgcctt cgggcgcgtt 421 ttttgttgac agcgtgaaaa cagtacgggt actgtactaa agtcacttaa ggaaacaaac 481 atgaaacaca taccgttttt cttcgcattc ttttttacct tcccctgaat gggaggcgtt 541 tcgtcgtgtg aaacagaatg cgaagacgaa caataaggcc tcccaaatcg gggggccttt 601 tttattgata acaaaaaggc aacactatga catcggaaaa cccgttactg gcgctgcgag 661 agaaaatcag cgcgctggat gaaaaattat tagcgttact ggcagaacgg cgcgaactgg 721 ccgtcgaggt gggaaaagcc aaactgctct cgcatcgccc ggtacgtgat attgatcgtg 781 aacgcgattt gctggaaaga ttaattacgc tcggtaaagc gcaccatctg gacgcccatt 841 acattactcg cctgttccag ctcatcattg aagattccgt attaactcag caggctttgc 901 tccaacaaca tctcaataaa attaatccgc actcagcacg catcgctttt ctcggcccca 961 aaggttctta ttcccatctt gcggcgcgcc agtatgctgc ccgtcacttt gagcaattca 1021 ttgaaagtgg ctgcgccaaa tttgccgata tttttaatca ggtggaaacc ggccaggccg 1081 actatgccgt cgtaccgatt gaaaatacca gctccggtgc cataaacgac gtttacgatc 1141 tgctgcaaca taccagcttg tcgattgttg gcgagatgac gttaactatc gaccattgtt 1201 tgttggtctc cggcactact gatttatcca ccatcaatac ggtctacagc catccgcagc 1261 cattccagca atgcagcaaa ttccttaatc gttatccgca ctggaagatt gaatataccg 1321 aaagtacgtc tgcggcaatg gaaaaggttg cacaggcaaa atcaccgcat gttgctgcgt 1381 tgggaagcga agctggcggc actttgtacg gtttgcaggt actggagcgt attgaagcaa 1441 atcagcgaca aaacttcacc cgatttgtgg tgttggcgcg taaagccatt aacgtgtctg 1501 atcaggttcc ggcgaaaacc acgttgttaa tggcgaccgg gcaacaagcc ggtgcgctgg 1561 ttgaagcgtt gctggtactg cgcaaccaca atctgattat gacccgtctg gaatcacgcc 1621 cgattcacgg taatccatgg gaagagatgt tctatctgga tattcaggcc aatcttgaat 1681 cagcggaaat gcaaaaagca ttgaaagagt taggggaaat cacccgttca atgaaggtat 1741 tgggctgtta cccaagtgag aacgtagtgc ctgttgatcc aacctgatga aaaggtgccg 1801 gatgatgtga atcatccggc actggattat tactggcgat tgtcattcgc ctgacgcaat 1861 aacacgcggc tttcactctg aaaacgctgt gcgtaatcgc cgaaccagtg ctccaccttg 1921 cggaaactgt caataaacgc ctgcttatcg ccctgctcca gcaactcaat cgcctcgccg 1981 aaacgcttat agtaacgttt gattaacgcc agattacgct ctgacgacat aatgatgtcg 2041 gcataaagct gcggatcctg agcaaacagt cgcccgacca tcgccagctc aaggcggtaa 2101 atcggcgaag agagcgccag aagttgctca agctgaacat tttcttctgc caggtgcagc 2161 ccgtaagcaa aagtagcaaa gtggcgcagt gcctgaataa acgccatatt ctgatcgtgc 2221 tcgacggcgc taatacgatg cagccgagcg ccccagacct gaatttgctc cagaaaccat 2281 tggtatgctt ccggtttacg tccatcacac cagaccacaa cttgctttgc caggctaccg 2341 ctgtccggac cgaacatcgg gtgtagcccc agcaccggac catcatgcgc caccagcatg 2401 gcctgtaatg gcccattttt cactgatgcc agatcgacca gaatacaatc tttcggtaaa 2461 ggcggtaatt tgccaataac ttgctcagta acgtggattg gcacactaac aatcaccatt 2521 ccggcatcgg caacaatatc agccgctcga tcccagtcat gttgctccag aatccgcacc 2581 tgataacccg agagggtcag catcttctcg aacaggcgtc ccatctgacc gccaccgccg 2641 acgataacca ccggacgcag tgacggacaa agtgttttaa atcctttgtc gttttcactg 2701 gagtaagatt cacgcatcac ccgacgcaaa acatcctcaa tcagatctgg cggtacaccc 2761 agagcttccg cctctgcacg acgcgaggcc aacatagatg cctcgcgctc cggaacataa 2821 ataggcagtc caaagcggct tttcacctcg cccacttcag caaccagttc cagacgcttc 2881 gctaataaat tcagcagcgc tttatcgact tcatcaattt gatcgcgtaa tgcggtcaat 2941 tcagcaacca taataaacct cttaagccac gcgagccgtc agctgcccgt tcagatcctg 3001 atgaatttca cgcagcaagg catcggtcat ttcccagcta atgcaggcat cggttacgga 3061 tacaccgtat ttcatttcac tgcgcggttg ctcggaagac tgattgccct cgtggatatt 3121 actttcgatc atcagaccaa taattgagcg attgccatct ttgatttgag caaccacgga 3181 ttctgccacc gcaggctgac ggcgataatc tttattggaa ttaccgtggc tgcaatctac 3241 catcagagac gggcgcagtc ccgcctgttc catctctttt tcacattgcg caacatccgc 3301 agggctatag ttcggcgctt taccaccgcg caggatcaca tggccgtccg gattcccctg 3361 agtttgtagc aacgcaacct gccctgcctg gttaatgcca acaaaacggt gcggctgggc 3421 ggcggcgcgc atagcgttaa ttgctgttgc cagactgccg tcggtgccgt ttttaaaacc 3481 aaccggcatg gaaagcccgg aggccatttc acggtgagtt tgcgattccg ttgtacgagc 3541 accaattgct gaccagctaa acagatcgcc caggtattgc gggctattcg gatctaacgc 3601 ttccgtcgcc agtggcagtc ccatattcac cagctcaagc agcaatttac gcgcgatctg 3661 cagcccggct tctacatcaa aagagccatc catatgggga tcgttaatta accctttcca 3721 gccgacagtg gtacggggtt tttcaaaata gacgcgcatt accagataga ggctatcgct 3781 gacctctgcg gcaagggctt taaatcgacg agcatattcc agagcagttt ccggatcatg 3841 aatggaacaa ggaccacata ctaccagcag acgaggatcg cgcccggcga taatatctga 3901 aatgctttta cgcgagtcag caatctgggc ttcttgttgc aggctcaatg gaaaagcggc 3961 cttcagttgt tccggagtca ttaaaacctg ttcgtcggta atatgtacgt tattcagcgc 4021 gtctttttgc atgatggcga tcctgtttat gctcgtttgc gatagttgat cctcagcgag 4081 gatgacgtaa cgataacaca taaagtaaag ttttcaatcc atatttcgta catttttatt 4141 tacacaggca atttagtcgc gctttcaacc cttacctctg tatagataaa tttacactcc 4201 ctttgaaaac aatccgctat gctttgaaaa aggagaaaga aatgatgaaa aagtttatcg 4261 cccccttgtt ggctttactg gttagcggat gtcagattga tccttatact cacgcgccaa 4321 ccttgaccag caccgactgg tatgatgtcg gtatggaaga tgcgatatcg ggcagcgcca 4381 taaaagatga cgatgcattt agcgattcac aggcggatcg cggtctatac cttaaaggat 4441 atgccgaagg acaaaagaaa acttgccaga ccgattttac ttatgcccga ggactttccg 4501 gtaaaagct //