GenBank-Updates@genbank.bio.net (04/30/91)
LOCUS CLOCELD 2286 bp ds-DNA BCT 30-APR-1991
DEFINITION Clostridium thermocellum celD gene for endoglucanase D.
ACCESSION X04584
KEYWORDS celD gene; endoglucanase D; inverted repeat.
SOURCE Clostridium thermocellum.
ORGANISM Clostridium thermocellum
Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci;
Bacillaceae.
REFERENCE 1 (bases 1 to 2286)
AUTHORS Joliff,G., Beguin,P. and Aubert,J.-P.
TITLE Nucleotide sequence of the cellulase gene celD encoding
endoglucanase D of Clostridium thermocellum
JOURNAL Nucleic Acids Res. 14, 8605-8613 (1986)
STANDARD simple staff_entry
REFERENCE 2 (sites)
AUTHORS Mishra,S., Beguin,P. and Aubert,J.-P.
TITLE Transcription of clostridium thermoncellum endoglucanase genes celF
and celD
JOURNAL J. Bacteriol. 173, 80-85 (1991)
STANDARD full staff_review
COMMENT
[Nucleic Acids Res. 14, 8605-8613 (1986)] enum. -200 to 2086, no
zero.
EMBL features not translated to GenBank features:
key from to description
RBS 181 188 put. rRNA binding site
RPT 1953 2024 direct repeat 1
RPT 2061 2132 direct repeat 1
RPT 2166 2179 imp. inverted repeat A
INVREP 2180 2194 imp. inverted repeat A'
TERM 2166 2194 pot. transcription terminator
(stem-loop structure)
INVREP 2198 2214 imp. inverted repeat B
INVREP 2220 2235 imp. inverted repeat B'
TERM 2198 2235 pot. transcription terminator
(stem-loop structure)
FEATURES Location/Qualifiers
CDS 201..2150
/note="put. precursor"
/codon_start=201
sig_peptide 201..323
/note="put. signal peptide (AA 1-41)"
/codon_start=201
mat_peptide 324..2147
/note="put. endoglucanase D (AA 1-608)"
/codon_start=324
CDS complement(2275..>2286)
/note="X gene; carboxy terminus"
/codon_start=2286
BASE COUNT 744 a 410 c 513 g 619 t
ORIGIN
1 aaactaaaac tcctatccaa tactttagtt cagttccagc atacgtctgt attcaaaatg
61 cctgtattta taactgcatt tataatacct gaagcaaata ataattaaac ttgtggaaga
121 aaggaggttg caacaggttt taaattatct taattcaggt attttacaat ttttaataaa
181 aagggggata aaggtaaaaa atgagtagaa tgaccttgaa aagcagcatg aaaaaacgtg
241 tgttatcttt gctcattgct gtagtgtttc taagcttgac cggagtattt ccttcgggat
301 tgattgagac caaagtgtca gctgcaaaaa taacggagaa ttatcaattt gattcacgaa
361 tccgtttaaa ctcaataggt tttataccga accacagcaa aaaggcgact atagctgcaa
421 attgttcaac cttttatgtt gttaaagaag acggaacaat agtgtatacc ggaacggcaa
481 cttcaatgtt tgacaatgat acaaaagaaa ctgtttatat tgctgatttt tcatctgtta
541 atgaagaagg aacgtactat cttgccgtgc cgggagtagg aaaaagcgta aactttaaaa
601 ttgcaatgaa tgtatatgag gatgctttta aaacagcaat gctgggaatg tatttgctgc
661 gctgcggcac cagtgtgtcg gccacataca acggaataca ctattcccat ggaccgtgcc
721 atactaatga tgcatatctt gattatataa acggacagca tactaaaaaa gacagtacaa
781 aaggctggca tgatgcgggc gactacaaca aatatgtggt aaacgccggc ataaccgttg
841 gttcaatgtt cctggcgtgg gagcatttta aagaccagtt ggagcctgtg gcattggaga
901 ttcccgaaaa gaacaattca ataccggatt ttcttgatga attaaaatat gagatagact
961 ggattcttac catgcaatac cctgacggga gcggaagggt ggctcataaa gtttcgacaa
1021 ggaactttgg cggctttatc atgcctgaga acgaacacga cgaaagattt ttcgtgccct
1081 ggagcagtgc cgcaacggca gactttgttg ccatgacggc catggctgca agaatattca
1141 ggccttatga tcctcaatat gctgaaaaat gtataaatgc ggcaaaagta agctatgagt
1201 ttttgaagaa caatcctgcg aatgtttttg caaaccagag tggattctca acaggagaat
1261 atgccactgt cagtgatgca gatgacagat tgtgggcggc ggctgaaatg tgggagaccc
1321 tgggagatga agaatacctt agagattttg aaaacagggc ggcgcaattc tcgaaaaaaa
1381 tagaagccga ttttgactgg gataatgttg caaacttagg tatgtttaca tatcttttgt
1441 cagaaagacc gggcaagaat cctgctttgg tgcagtcaat aaaggatagt ctcctttcca
1501 ctgcggattc aattgtgagg accagccaaa accatggcta tggcagaacc cttggtacaa
1561 catattactg gggatgcaac ggcacggttg taagacagac tatgatactt caggttgcga
1621 acaagatttc acccaacaat gattatgtaa atgctgctct cgatgcgatt tcacatgtat
1681 ttggaagaaa ctattacaac aggtcttatg taacaggcct tggtataaat cctcctatga
1741 atcctcatga cagacgttca ggggctgacg gaatatggga gccgtggccc ggttaccttg
1801 taggaggagg atggcccgga ccgaaggatt gggtggatat tcaggacagt tatcagacca
1861 atgaaattgc tataaactgg aatgcggcat tgatttatgc ccttgccgga tttgtcaact
1921 ataattctcc tcaaaatgaa gtactgtacg gagatgtgaa tgatgacgga aaagtaaact
1981 ccactgactt gactttgtta aaaagatatg ttcttaaagc cgtctcaact ctcccttctt
2041 ccaaagctga aaagaacgca gatgtaaatc gtgacggaag agttaattcc agtgatgtca
2101 caatactttc aagatatttg ataagggtaa tcgagaaatt accaatataa attctgataa
2161 atattgataa acactaatat ataagtgttt aatcggtaaa agagccctgt ggcaaaaact
2221 gccgcaggct gtttttatca attccggcgc agacgaaaat agcagacgta aatattaatt
2281 actgaa
//