[bionet.molbio.genbank.updates] Clostridium thermocellum celD gene for endoglucanase D.

GenBank-Updates@genbank.bio.net (04/30/91)

LOCUS       CLOCELD      2286 bp ds-DNA             BCT       30-APR-1991
DEFINITION  Clostridium thermocellum celD gene for endoglucanase D.
ACCESSION   X04584
KEYWORDS    celD gene; endoglucanase D; inverted repeat.
SOURCE      Clostridium thermocellum.
  ORGANISM  Clostridium thermocellum
            Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci;
            Bacillaceae.
REFERENCE   1  (bases 1 to 2286)
  AUTHORS   Joliff,G., Beguin,P. and Aubert,J.-P.
  TITLE     Nucleotide sequence of the cellulase gene celD encoding
            endoglucanase D of Clostridium thermocellum
  JOURNAL   Nucleic Acids Res. 14, 8605-8613 (1986)
  STANDARD  simple staff_entry
REFERENCE   2  (sites)
  AUTHORS   Mishra,S., Beguin,P. and Aubert,J.-P.
  TITLE     Transcription of clostridium thermoncellum endoglucanase genes celF
            and celD
  JOURNAL   J. Bacteriol. 173, 80-85 (1991)
  STANDARD  full staff_review
COMMENT
            [Nucleic Acids Res. 14, 8605-8613 (1986)]  enum. -200 to 2086, no
            zero.
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               RBS         181    188       put. rRNA binding site
            
               RPT        1953   2024       direct repeat 1
               RPT        2061   2132       direct repeat 1
               RPT        2166   2179       imp. inverted repeat A
               INVREP     2180   2194       imp. inverted repeat A'
               TERM       2166   2194       pot. transcription terminator
                                            (stem-loop structure)
               INVREP     2198   2214       imp. inverted repeat B
               INVREP     2220   2235       imp. inverted repeat B'
               TERM       2198   2235       pot. transcription terminator
                                            (stem-loop structure)
FEATURES             Location/Qualifiers
     CDS             201..2150
                     /note="put. precursor"
                     /codon_start=201
     sig_peptide     201..323
                     /note="put. signal peptide (AA 1-41)"
                     /codon_start=201
     mat_peptide     324..2147
                     /note="put. endoglucanase D (AA 1-608)"
                     /codon_start=324
     CDS             complement(2275..>2286)
                     /note="X gene; carboxy terminus"
                     /codon_start=2286
BASE COUNT      744 a    410 c    513 g    619 t
ORIGIN
        1 aaactaaaac tcctatccaa tactttagtt cagttccagc atacgtctgt attcaaaatg
       61 cctgtattta taactgcatt tataatacct gaagcaaata ataattaaac ttgtggaaga
      121 aaggaggttg caacaggttt taaattatct taattcaggt attttacaat ttttaataaa
      181 aagggggata aaggtaaaaa atgagtagaa tgaccttgaa aagcagcatg aaaaaacgtg
      241 tgttatcttt gctcattgct gtagtgtttc taagcttgac cggagtattt ccttcgggat
      301 tgattgagac caaagtgtca gctgcaaaaa taacggagaa ttatcaattt gattcacgaa
      361 tccgtttaaa ctcaataggt tttataccga accacagcaa aaaggcgact atagctgcaa
      421 attgttcaac cttttatgtt gttaaagaag acggaacaat agtgtatacc ggaacggcaa
      481 cttcaatgtt tgacaatgat acaaaagaaa ctgtttatat tgctgatttt tcatctgtta
      541 atgaagaagg aacgtactat cttgccgtgc cgggagtagg aaaaagcgta aactttaaaa
      601 ttgcaatgaa tgtatatgag gatgctttta aaacagcaat gctgggaatg tatttgctgc
      661 gctgcggcac cagtgtgtcg gccacataca acggaataca ctattcccat ggaccgtgcc
      721 atactaatga tgcatatctt gattatataa acggacagca tactaaaaaa gacagtacaa
      781 aaggctggca tgatgcgggc gactacaaca aatatgtggt aaacgccggc ataaccgttg
      841 gttcaatgtt cctggcgtgg gagcatttta aagaccagtt ggagcctgtg gcattggaga
      901 ttcccgaaaa gaacaattca ataccggatt ttcttgatga attaaaatat gagatagact
      961 ggattcttac catgcaatac cctgacggga gcggaagggt ggctcataaa gtttcgacaa
     1021 ggaactttgg cggctttatc atgcctgaga acgaacacga cgaaagattt ttcgtgccct
     1081 ggagcagtgc cgcaacggca gactttgttg ccatgacggc catggctgca agaatattca
     1141 ggccttatga tcctcaatat gctgaaaaat gtataaatgc ggcaaaagta agctatgagt
     1201 ttttgaagaa caatcctgcg aatgtttttg caaaccagag tggattctca acaggagaat
     1261 atgccactgt cagtgatgca gatgacagat tgtgggcggc ggctgaaatg tgggagaccc
     1321 tgggagatga agaatacctt agagattttg aaaacagggc ggcgcaattc tcgaaaaaaa
     1381 tagaagccga ttttgactgg gataatgttg caaacttagg tatgtttaca tatcttttgt
     1441 cagaaagacc gggcaagaat cctgctttgg tgcagtcaat aaaggatagt ctcctttcca
     1501 ctgcggattc aattgtgagg accagccaaa accatggcta tggcagaacc cttggtacaa
     1561 catattactg gggatgcaac ggcacggttg taagacagac tatgatactt caggttgcga
     1621 acaagatttc acccaacaat gattatgtaa atgctgctct cgatgcgatt tcacatgtat
     1681 ttggaagaaa ctattacaac aggtcttatg taacaggcct tggtataaat cctcctatga
     1741 atcctcatga cagacgttca ggggctgacg gaatatggga gccgtggccc ggttaccttg
     1801 taggaggagg atggcccgga ccgaaggatt gggtggatat tcaggacagt tatcagacca
     1861 atgaaattgc tataaactgg aatgcggcat tgatttatgc ccttgccgga tttgtcaact
     1921 ataattctcc tcaaaatgaa gtactgtacg gagatgtgaa tgatgacgga aaagtaaact
     1981 ccactgactt gactttgtta aaaagatatg ttcttaaagc cgtctcaact ctcccttctt
     2041 ccaaagctga aaagaacgca gatgtaaatc gtgacggaag agttaattcc agtgatgtca
     2101 caatactttc aagatatttg ataagggtaa tcgagaaatt accaatataa attctgataa
     2161 atattgataa acactaatat ataagtgttt aatcggtaaa agagccctgt ggcaaaaact
     2221 gccgcaggct gtttttatca attccggcgc agacgaaaat agcagacgta aatattaatt
     2281 actgaa
//