GenBank-Updates@genbank.bio.net (05/27/91)
LOCUS ECOCELOPE 4989 bp ds-DNA BCT 27-MAY-1991 DEFINITION Escherichia coli DNA for cel operon including celA, celB, celC, celD and celF genes ACCESSION X52890 X53290 KEYWORDS PEP dependent phosphotransferase enzyme II-cellobiose; PEP dependent phosphotransferase enzyme III-cellobiose; PEP:sugar phosphotransferase system; celA gene; celB gene; celC gene; celD gene; celF gene; phospho-beta-glucosidase; phosphoenolpyruvate-dependent phosphotransferase system; repressor. SOURCE Escherichia coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 4989) AUTHORS Hall,B.G. JOURNAL Unpublished (1990) STANDARD full automatic REFERENCE 2 (bases 1 to 4989) AUTHORS Parker,L.L. and Hall,B.G. TITLE Characterization and Nucleotide Sequence of the Cryptic cel Operon of Escherichia coli K12 JOURNAL Genetics 124, 455-471 (1990) STANDARD full automatic REFERENCE 3 (sites) AUTHORS Parker,L.L. and Hall,B.G. TITLE Mechanisms of activation of the Cryptic cel Operon of Escherichia coli K12 JOURNAL Genetics 124, 473-482 (1990) STANDARD full staff_review COMMENT SWISS-PROT; P17334; PT2C$ECOLI. SWISS-PROT; P17335; PT3C$ECOLI. SWISS-PROT; P17409; CELA$ECOLI. SWISS-PROT; P17410; CELD$ECOLI. SWISS-PROT; P17411; CELF$ECOLI. *source: strain=K12 (MK2). **map: 38.0 minutes (physical map of kohara et al). Data kindly reviewed (02-OCT-1990) by Hall B.G. From EMBL entry ECCELOPE; dated 02-OCT-1990. FEATURES Location/Qualifiers precursor_RNA 238..>237 /note="transcript" RBS 275..279 /note="put. ribosome binding site" CDS 286..603 /note="celA product, unknown" /codon_start=286 RBS 677..681 /note="put. ribosome binding site" CDS 691..1941 /note="celB product, phosphoenolpyruvate dependent phosphotransferase enzyme II-cellobiose" /codon_start=691 RBS 2075..2082 /note="put. ribosome binding site" CDS 2089..2436 /note="celC product, phosphoenolpyruvate dependent phosphotransferase enzyme III-cellobiose" /codon_start=2089 RBS 2438..2443 /note="put. ribosome binding site" CDS 2447..3286 /note="celD product, repressor of the cel operon" /codon_start=2447 RBS 3381..3388 /note="put. ribosome binding site" CDS 3394..4509 /note="celF product, phospho-beta- glucosidase" /codon_start=3394 BASE COUNT 1312 a 1032 c 1260 g 1385 t ORIGIN 1 aagcttgagt aacaacggaa accggccatt gcgccggttt tttttggcct gagttcttaa 61 ttatcttcgc gaattatttg cccgaaatgt gaagagggtc ataaccacag gtcaaggaga 121 aacaatttat aaggtcaaag aaatactatt gctcaggtct ataccgtata ctcctttcag 181 ccacaaaaaa agtcatgttg gtttcgcaag ttacccagag catggaagca ggttaaggct 241 tgcggagtgt ctggctgaca gataatcgtc gatgagggca gttttatgga aaagaaacac 301 atttatctgt tttgttctgc gggcatgtct acctctttac tggtatcaaa aatgcgcgca 361 caggcagaaa aatatgaagt tccggtcatt attgaagcat ttccggaaac actggctggt 421 gaaaaaggtc agaatgccga tgtcgtgtta ttagggccgc agattgctta tatgttgccc 481 gaaatccagc gtttgttacc caacaaaccg gttgaagtaa ttgactcgct gctttatggc 541 aaagtcgatg gtttaggcgt gcttaaggct gcggttgcag cgattaaaaa agccgcagca 601 aattaattta ttttaaattt tcccgtcaaa gagttatttc ataaatcaat accgcaatat 661 ttaaattgcg gtttttaagg gtatttttct atgagtaatg ttattgcatc gcttgaaaag 721 gtactcctcc cttttgcagt taaaatagga aagcagccac acgttaatgc aatcaaaaat 781 ggctttattc gcttaatgcc gttaaccctt gcgggggcca tgtttgtatt aattaacaac 841 gtttttctaa gctttgggga ggggtcgttt ttttattcct taggtattcg cctcgacgcc 901 tcaaccattg aaacacttaa tggtctgaaa ggtattggcg gcaacgtata taacggaaca 961 ttaggaataa tgtctttaat ggcaccgttc tttattggca tggcgctggc agaagagcgt 1021 aaagtcgatg cgctggcggc tgggttgtta tccgttgcag catttatgac cgtcacccca 1081 tatagtgtcg gtgaggccta tgcggttggt gcaaactggt taggtggggc gaatatcatc 1141 tccgggatta ttattggcct ggtggtggca gaaatgttta cctttattgt ccgccgcaat 1201 tgggtcatta aactgcccga cagcgtacct gcttcagtat cgcgttcctt ctcggcattt 1261 aattcccggc tttattattc tttccgtgat ggggattatt gcctggcgtt gaatacctgg 1321 ggcaccaact tccatcagat cattatggat accatctcaa ccccactggc atcgttgggt 1381 agcgtgtggc tggcctatgt gatcttgtcc actgctctgg ttcttcgtat tcatgctgct 1441 tgcgctgacc gcactggaca acggcattat gacgcgtggg cactggaaaa tatcgcgacc 1501 tatcagcaat atggttccgt cgaagcggcg ctggcagccg gtaagacctt ccatatctgg 1561 gccaagccga tgctggactc ctttattttc cttgggggca gtggtgcgac tttaggcctg 1621 atcctggcta tctttatcgc ctctcgccgt gctgattatc gtcaggtggc aaaactggcg 1681 ctgccgtccg gcatcttcca gattaacgaa ccgattctgt ttggtctgcc aattatcatg 1741 aacccggtga tgtttatccc gttgtactgg tacaaccgga ttctggcggc aatcaccctc 1801 gcagcgtact acatgggcat tattcctccg gtgaccaata ttgcaccgtg gaccatgcca 1861 accggtctgg gagccttctt taacaccaac ggtacgtcgc cgcattgctg gtcgcactct 1921 tcaaccttgg catcgcaacg ttaatttatc tgccctttgt tgtggtggct aacaaagcac 1981 aaaatgcgat tgataaagaa gagagcgaag aagatatcgt aacgccctga aattttaatc 2041 gatgccggta cgggtaatcg tgccggtaag aaaagagagg aacgatgtat gatggatctc 2101 gataacattc ccgatacgca aacggaagct gaagagctgg aagaagtggt gatggggctg 2161 atcatcaact ccggacaagc gcgcagcctg gcgtatgcgg cactgaaaca ggcgaacagg 2221 ggcgattttg ccgcagcaaa agccatgatg gatcagtcac ggatggcatt gaatgaagcg 2281 catctggtac agacgaaact gattgaaggc gatgcgggcg aaggtaagat gaaagtgagt 2341 ctggtgctgg tccacgctca ggatcattta atgacgtcca tgcttgcgcg tgaactgatt 2401 actgaattaa ttgagcttca tgaaaaactg aaggcataag gagtcgatga tgcagccagt 2461 gattaacgcg ccggaaattg ccactgcccg agaacagcag ttgtttaatg gcaaaaactt 2521 ccatgtgttt atctataaca aaactgagag tatcagcgga ctgcatcagc acgactatta 2581 tgaatttact ctggtattaa ccgggcgtta tttccaggag attaacggta agcgcgtgtt 2641 actggaacgg ggcgattttg tttttattcc gttaggttcg caccatcaaa gtttttatga 2701 gtttggtgcc acgcgcatat tgaacgttgg gatcagtaaa cgcttttttg agcagcatta 2761 cctgccattg ttgccttatt gctttgtcgc ttcgcaggta taccggacca ataacgcttt 2821 tctcacctat gtggaaacag tgatttcttc attgaatttc cgcgaaacag ggctggaaga 2881 gtttgttgag atggttactt tttatgtcat taaccgttta cgtcattacc gcgaagaaca 2941 ggtgattgat gatgtaccgc agtggctgaa aagtacggta gaaaagatgc atgataaaga 3001 gcagtttagt gaatcggcgc tggagaatat ggtggcgttg tcagccaaat cacaggaata 3061 tttgacgcga gcgactcaac gatattatgg caaaacgcca atgcagatta ttaatgaaat 3121 ccgtattaat tttgccaaaa aacaactgga aatgaccaac tattcagtga cggatattgc 3181 gtttgaggcc ggttatagta gcccgagttt gtttattaaa acgtttaaga aattaacgtc 3241 ctttacgcct aagagctatc gtaagaaatt gactgaattt aatcagtaag tcgttatacc 3301 tgacaattca catatctgtc ctgttgctgg attatttatg tccggggggg cagatatgcc 3361 agatatcagt attctgtact gaagggagaa attatgagcc agaaattaaa agtcgtcact 3421 attggtggcg ggagcagcta taccccggag ttactggaag gatttattaa gcgttatcac 3481 gaattgccgg tcagcgaatt atggctggtg gatgtcgaag gtggtaaacc gaaactggat 3541 attattttcg atctctgcca acggatgatt gataacgctg gcgtcccgat gaagctttat 3601 aaaacgctgg atcgccgcga agcattgaaa gatgctgatt tcgttactac ccaactgcgc 3661 gttggccaat taccggcgcg tgaactggat gaacgtattc cattaagtca tggttatctt 3721 ggtcaggaaa ccaacggcgc gggcggtttg tttaaaggtc tgcgtaccat tccggtgatt 3781 tttgacatcg taaaagatgt cgaagaactt tgtccgaatg catgggtgat taacttcact 3841 aacccggcgg gaatggtcac tgaagccgtt tatcgtcata ccggatttaa acgctttatc 3901 ggcgtgtgta atattccgat cggcatgaag atgtttattc gcgatgttct gatgctgaaa 3961 gacagcgatg atttatctat cgatttgttc ggcctcaacc atatggtgtt cattaaggat 4021 gtgctgataa atggcaagtc gcgctttgcc gaattgcttg atggtgtggc gtcagggcag 4081 ttaaaagcgt cctctgtaaa aaatattttc gatctgccat ttagtgaggg cttaattcgt 4141 tcgttgaatc tgctgccatg ttcttatctg ctgtattact tcaagcagaa agagatgctg 4201 gctattgaaa tgggcgaata ctacaaaggc ggcgcacgag cgcaggtagt acagaaagtc 4261 gagaaacaac tttttgagct gtataaaaat cctgagctga aagttaagcc gaaagaactg 4321 gaacagcgcg gtggggctta ttactctgat gcagcatgcg aagtgatcaa cgctatctac 4381 aacgacaagc aagcagaaca ttacgttaat atcccgcatc atgggcagat tgataatatt 4441 ccggcagact gggcagtaga aatgacctgt aagctggggc gcgatggcgc gacgccacat 4501 ccgcgcaatt aggcatttcg atgataaagt aatggggctg ttcacaccat taaaggcttc 4561 aagattgctg ccagtaacgc cgcaacttaa cggagaattg aacgatatgt tactggcgct 4621 aaaccttagt ccgttggtgc attccgatcg cgatgctgag ctgctggcac gcgagatgat 4681 tctggcgcac gagaaatggc tgccaaactt tgccgactgc atcgcagagc ttaaaaaagc 4741 acattaaccg aggctgatta tggaacgctt actgcttgtt aatgccgatg attttggctt 4801 aagtcaaagg acagaactac ggcattatcg aggcctgtcg caatgggatt gtcactgtcg 4861 acgacgtcac tgtgaatggc aggctattga ccatgcggtg catttgagtt gtgatgaacc 4921 aattctggcc atagggatgc actttgtcct tattatgggt aagccactga cagctatgcc 4981 ggggttaac //