GenBank-Updates@genbank.bio.net (09/14/90)
LOCUS CLOCELH 3002 bp ds-DNA BCT 14-SEP-1990
DEFINITION C.thermocellum cellulase (celH) gene, complete cds.
ACCESSION M31903
KEYWORDS cellulase; endoglucanase.
SOURCE C.thermocellum (strain NCIB 10682) DNA.
ORGANISM Clostridium thermocellum
Prokaryota; Bacteria; Firmicutes;
Endospore-forming rods and cocci; Bacillaceae.
REFERENCE 1 (bases 1 to 3002)
AUTHORS Yaguee,E., Beguin,P. and Aubert,J.-P.
TITLE Nucleotide sequence and deletion analysis of the cellulase-
encoding gene celH of Clostridium thermocellum
JOURNAL Gene 89, 61-67 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by P. Beguin, 07-FEB-1990.
FEATURES from to/span description
pept 171 2873 cellulase precursor (EC 3.2.1.4)
sigp 171 302 cellulase signal peptide
matp 303 2870 cellulase
binding 159 164 ribosome binding site
BASE COUNT 1003 a 572 c 628 g 799 t
ORIGIN
1 cagaaagtat taattagtag aggggaaaat tatttaaacc caaaatttaa aatgccattt
61 ttgacaaaat accattggga aaggaggata tactttaaca accggcattt aagaacaatt
121 taaattaatt aaaattttgc tttttaaagt tttctaaagg gagggacatt atgaaaaaaa
181 ggcttttagt ttcttttttg gtgttaagca taattgtagg attactttct tttcagtcgc
241 ttggtaatta caacagtggt ttaaaaatcg gtgcttgggt gggaacccag ccgtcagaat
301 cagcaattaa gagttttcag gaacttcagg gtagaaagct tgatattgtc caccagttta
361 ttaactggtc aactgatttt tcctgggtaa gaccttatgc cgacgctgtt tataataacg
421 gctcaatatt aatgattacc tgggaacctt gggaatacaa cactgtagat atcaaaaacg
481 gtaaagcgga tgcttacata accagaatgg cgcaagatat gaaagcctat ggcaaggaaa
541 tttggttaag acctcttcat gaagccaacg gagactggta tccatgggcc ataggatatt
601 cttcaagagt aaacacaaac gaaacttaca tagccgcttt cagacatatt gtcgatattt
661 tccgtgccaa cggagccacc aacgtcaaat gggtgtttaa tgtaaactgc gacaatgtag
721 gtaacggcac aagttatctg ggtcattatc ccggagataa ttatgtagac tacacctcaa
781 ttgacggata caactggggt accactcaaa gctggggaag ccaatggcaa agctttgatc
841 aggttttctc cagagcctac caagctttgg catcaataaa caaacccatc attatagcag
901 agtttgcatc agctgaaata ggcggaaaca aggcaagatg gattacagaa gcatataact
961 ctataagaac atcctacaac aaggtaattg ctgcagtatg gtttcacgag aacaaagaaa
1021 ccgactggag aatcaactca agtcctgaag cccttgcagc atacagggag gcaataggag
1081 ccggttcatc aaatcctacc cctactccaa cttggacctc tactccacca tcaagctcac
1141 caaaggctgt cgaccccttt gaaatggtta gaaaaatggg tatgggaaca aacctcggaa
1201 acactctcga agctccctat gaaggctcct ggtccaagtc tgccatggaa tattattttg
1261 atgattttaa agctgcagga tataaaaacg taagaatccc tgtaagatgg gacaaccata
1321 caatgaggac atacccgtat accattgaca aagccttttt ggacagggtt gagcaagtgg
1381 ttgactggtc actttcaaga ggttttgtta caattataaa ttctcaccat gatgactgga
1441 tcaaggaaga ctataacgga aacatagaac ggtttgaaaa gatatgggaa cagattgcgg
1501 aaaggtttaa aaacaaatcc gaaaatcttc tgtttgaaat catgaatgag cctttcggta
1561 acattacaga cgaacaaata gacgacatga acagcagaat attaaaaata atcagaaaga
1621 ccaatccaac ccgtattgtt ataataggcg gaggttattg gaacagttat aatacgcttg
1681 taaacattaa aattcctgat gacccatact taatcggaac tttccattac tatgacccat
1741 atgaatttac tcacaagtgg agaggtacat ggggtactca ggaagacatg gatactgtag
1801 taagagtatt tgattttgtt aagagttggt ctgacagaaa caatatcccg gtatattttg
1861 gagaatttgc cgtaatggct tatgccgaca gaacttcccg tgtaaaatgg tatgatttta
1921 taagtgatgc ggccctggag cgcggttttg catgttccgt atgggataac ggcgtttttg
1981 gttcattgga taatgacatg gctatttaca acagagatac ccgtaccttt gacactgaaa
2041 tcctcaatgc actatttaat cccggaacat atccgtctta ttctccgaaa ccttcaccaa
2101 ctccaagacc gaccaaaccg cccgtaacac cggctgtcgg tgaaaaaatg ctggatgatt
2161 ttgagggtgt gttaaattgg ggttcatact ccggtgaagg tgcaaaagtt tcaacaaaaa
2221 ttgtgtccgg aaaaacagga aacggcatgg aagtcagcta caccgggaca acggacggct
2281 actggggaac agtatacagt ttaccggacg gcgattggtc aaaatggctt aaaatctctt
2341 ttgacattaa gtccgttgac ggttctgcca atgaaatcag atttatgatt gctgaaaaaa
2401 gcataaacgg tgtgggagac ggagaacact gggtttactc aataactccc gacagttcgt
2461 ggaaaactat agaaataccg ttctccagct ttagaagaag acttgattat cagccgcctg
2521 gacaggatat gagcggtact ttggatcttg acaatataga ttcaattcac ttcatgtatg
2581 ccaacaacaa gtcgggaaaa tttgtcgtag acaatatcaa gctgattggt gctacttccg
2641 atccgactcc ttcaataaaa cacggagatt tgaacttcga taatgcagtg aattctacag
2701 acttgttaat gcttaaaagg tatatcctca aatctttgga actcggtaca tctgagcagg
2761 aggaaaaatt caaaaaagcg gcagatttaa acagggacaa caaggtcgac tccactgact
2821 tgacaatttt gaaaagatac ttgctgaaag ccatcagtga aatacccata taaatttcag
2881 gcataaattt tcaggcaaaa tttaatttat tataaaatac tgtgggcatg ctgcaaatag
2941 gattagaatc accaccctca aaaatcctgt aaaaagcatg cccacaataa attttatttc
3001 at
//