GenBank-Updates@genbank.bio.net (04/30/91)
LOCUS CLOCELD 2286 bp ds-DNA BCT 30-APR-1991 DEFINITION Clostridium thermocellum celD gene for endoglucanase D. ACCESSION X04584 KEYWORDS celD gene; endoglucanase D; inverted repeat. SOURCE Clostridium thermocellum. ORGANISM Clostridium thermocellum Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2286) AUTHORS Joliff,G., Beguin,P. and Aubert,J.-P. TITLE Nucleotide sequence of the cellulase gene celD encoding endoglucanase D of Clostridium thermocellum JOURNAL Nucleic Acids Res. 14, 8605-8613 (1986) STANDARD simple staff_entry REFERENCE 2 (sites) AUTHORS Mishra,S., Beguin,P. and Aubert,J.-P. TITLE Transcription of clostridium thermoncellum endoglucanase genes celF and celD JOURNAL J. Bacteriol. 173, 80-85 (1991) STANDARD full staff_review COMMENT [Nucleic Acids Res. 14, 8605-8613 (1986)] enum. -200 to 2086, no zero. EMBL features not translated to GenBank features: key from to description RBS 181 188 put. rRNA binding site RPT 1953 2024 direct repeat 1 RPT 2061 2132 direct repeat 1 RPT 2166 2179 imp. inverted repeat A INVREP 2180 2194 imp. inverted repeat A' TERM 2166 2194 pot. transcription terminator (stem-loop structure) INVREP 2198 2214 imp. inverted repeat B INVREP 2220 2235 imp. inverted repeat B' TERM 2198 2235 pot. transcription terminator (stem-loop structure) FEATURES Location/Qualifiers CDS 201..2150 /note="put. precursor" /codon_start=201 sig_peptide 201..323 /note="put. signal peptide (AA 1-41)" /codon_start=201 mat_peptide 324..2147 /note="put. endoglucanase D (AA 1-608)" /codon_start=324 CDS complement(2275..>2286) /note="X gene; carboxy terminus" /codon_start=2286 BASE COUNT 744 a 410 c 513 g 619 t ORIGIN 1 aaactaaaac tcctatccaa tactttagtt cagttccagc atacgtctgt attcaaaatg 61 cctgtattta taactgcatt tataatacct gaagcaaata ataattaaac ttgtggaaga 121 aaggaggttg caacaggttt taaattatct taattcaggt attttacaat ttttaataaa 181 aagggggata aaggtaaaaa atgagtagaa tgaccttgaa aagcagcatg aaaaaacgtg 241 tgttatcttt gctcattgct gtagtgtttc taagcttgac cggagtattt ccttcgggat 301 tgattgagac caaagtgtca gctgcaaaaa taacggagaa ttatcaattt gattcacgaa 361 tccgtttaaa ctcaataggt tttataccga accacagcaa aaaggcgact atagctgcaa 421 attgttcaac cttttatgtt gttaaagaag acggaacaat agtgtatacc ggaacggcaa 481 cttcaatgtt tgacaatgat acaaaagaaa ctgtttatat tgctgatttt tcatctgtta 541 atgaagaagg aacgtactat cttgccgtgc cgggagtagg aaaaagcgta aactttaaaa 601 ttgcaatgaa tgtatatgag gatgctttta aaacagcaat gctgggaatg tatttgctgc 661 gctgcggcac cagtgtgtcg gccacataca acggaataca ctattcccat ggaccgtgcc 721 atactaatga tgcatatctt gattatataa acggacagca tactaaaaaa gacagtacaa 781 aaggctggca tgatgcgggc gactacaaca aatatgtggt aaacgccggc ataaccgttg 841 gttcaatgtt cctggcgtgg gagcatttta aagaccagtt ggagcctgtg gcattggaga 901 ttcccgaaaa gaacaattca ataccggatt ttcttgatga attaaaatat gagatagact 961 ggattcttac catgcaatac cctgacggga gcggaagggt ggctcataaa gtttcgacaa 1021 ggaactttgg cggctttatc atgcctgaga acgaacacga cgaaagattt ttcgtgccct 1081 ggagcagtgc cgcaacggca gactttgttg ccatgacggc catggctgca agaatattca 1141 ggccttatga tcctcaatat gctgaaaaat gtataaatgc ggcaaaagta agctatgagt 1201 ttttgaagaa caatcctgcg aatgtttttg caaaccagag tggattctca acaggagaat 1261 atgccactgt cagtgatgca gatgacagat tgtgggcggc ggctgaaatg tgggagaccc 1321 tgggagatga agaatacctt agagattttg aaaacagggc ggcgcaattc tcgaaaaaaa 1381 tagaagccga ttttgactgg gataatgttg caaacttagg tatgtttaca tatcttttgt 1441 cagaaagacc gggcaagaat cctgctttgg tgcagtcaat aaaggatagt ctcctttcca 1501 ctgcggattc aattgtgagg accagccaaa accatggcta tggcagaacc cttggtacaa 1561 catattactg gggatgcaac ggcacggttg taagacagac tatgatactt caggttgcga 1621 acaagatttc acccaacaat gattatgtaa atgctgctct cgatgcgatt tcacatgtat 1681 ttggaagaaa ctattacaac aggtcttatg taacaggcct tggtataaat cctcctatga 1741 atcctcatga cagacgttca ggggctgacg gaatatggga gccgtggccc ggttaccttg 1801 taggaggagg atggcccgga ccgaaggatt gggtggatat tcaggacagt tatcagacca 1861 atgaaattgc tataaactgg aatgcggcat tgatttatgc ccttgccgga tttgtcaact 1921 ataattctcc tcaaaatgaa gtactgtacg gagatgtgaa tgatgacgga aaagtaaact 1981 ccactgactt gactttgtta aaaagatatg ttcttaaagc cgtctcaact ctcccttctt 2041 ccaaagctga aaagaacgca gatgtaaatc gtgacggaag agttaattcc agtgatgtca 2101 caatactttc aagatatttg ataagggtaa tcgagaaatt accaatataa attctgataa 2161 atattgataa acactaatat ataagtgttt aatcggtaaa agagccctgt ggcaaaaact 2221 gccgcaggct gtttttatca attccggcgc agacgaaaat agcagacgta aatattaatt 2281 actgaa //