GenBank-Updates@genbank.bio.net (09/14/90)
LOCUS CLOCELH 3002 bp ds-DNA BCT 14-SEP-1990 DEFINITION C.thermocellum cellulase (celH) gene, complete cds. ACCESSION M31903 KEYWORDS cellulase; endoglucanase. SOURCE C.thermocellum (strain NCIB 10682) DNA. ORGANISM Clostridium thermocellum Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 3002) AUTHORS Yaguee,E., Beguin,P. and Aubert,J.-P. TITLE Nucleotide sequence and deletion analysis of the cellulase- encoding gene celH of Clostridium thermocellum JOURNAL Gene 89, 61-67 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P. Beguin, 07-FEB-1990. FEATURES from to/span description pept 171 2873 cellulase precursor (EC 3.2.1.4) sigp 171 302 cellulase signal peptide matp 303 2870 cellulase binding 159 164 ribosome binding site BASE COUNT 1003 a 572 c 628 g 799 t ORIGIN 1 cagaaagtat taattagtag aggggaaaat tatttaaacc caaaatttaa aatgccattt 61 ttgacaaaat accattggga aaggaggata tactttaaca accggcattt aagaacaatt 121 taaattaatt aaaattttgc tttttaaagt tttctaaagg gagggacatt atgaaaaaaa 181 ggcttttagt ttcttttttg gtgttaagca taattgtagg attactttct tttcagtcgc 241 ttggtaatta caacagtggt ttaaaaatcg gtgcttgggt gggaacccag ccgtcagaat 301 cagcaattaa gagttttcag gaacttcagg gtagaaagct tgatattgtc caccagttta 361 ttaactggtc aactgatttt tcctgggtaa gaccttatgc cgacgctgtt tataataacg 421 gctcaatatt aatgattacc tgggaacctt gggaatacaa cactgtagat atcaaaaacg 481 gtaaagcgga tgcttacata accagaatgg cgcaagatat gaaagcctat ggcaaggaaa 541 tttggttaag acctcttcat gaagccaacg gagactggta tccatgggcc ataggatatt 601 cttcaagagt aaacacaaac gaaacttaca tagccgcttt cagacatatt gtcgatattt 661 tccgtgccaa cggagccacc aacgtcaaat gggtgtttaa tgtaaactgc gacaatgtag 721 gtaacggcac aagttatctg ggtcattatc ccggagataa ttatgtagac tacacctcaa 781 ttgacggata caactggggt accactcaaa gctggggaag ccaatggcaa agctttgatc 841 aggttttctc cagagcctac caagctttgg catcaataaa caaacccatc attatagcag 901 agtttgcatc agctgaaata ggcggaaaca aggcaagatg gattacagaa gcatataact 961 ctataagaac atcctacaac aaggtaattg ctgcagtatg gtttcacgag aacaaagaaa 1021 ccgactggag aatcaactca agtcctgaag cccttgcagc atacagggag gcaataggag 1081 ccggttcatc aaatcctacc cctactccaa cttggacctc tactccacca tcaagctcac 1141 caaaggctgt cgaccccttt gaaatggtta gaaaaatggg tatgggaaca aacctcggaa 1201 acactctcga agctccctat gaaggctcct ggtccaagtc tgccatggaa tattattttg 1261 atgattttaa agctgcagga tataaaaacg taagaatccc tgtaagatgg gacaaccata 1321 caatgaggac atacccgtat accattgaca aagccttttt ggacagggtt gagcaagtgg 1381 ttgactggtc actttcaaga ggttttgtta caattataaa ttctcaccat gatgactgga 1441 tcaaggaaga ctataacgga aacatagaac ggtttgaaaa gatatgggaa cagattgcgg 1501 aaaggtttaa aaacaaatcc gaaaatcttc tgtttgaaat catgaatgag cctttcggta 1561 acattacaga cgaacaaata gacgacatga acagcagaat attaaaaata atcagaaaga 1621 ccaatccaac ccgtattgtt ataataggcg gaggttattg gaacagttat aatacgcttg 1681 taaacattaa aattcctgat gacccatact taatcggaac tttccattac tatgacccat 1741 atgaatttac tcacaagtgg agaggtacat ggggtactca ggaagacatg gatactgtag 1801 taagagtatt tgattttgtt aagagttggt ctgacagaaa caatatcccg gtatattttg 1861 gagaatttgc cgtaatggct tatgccgaca gaacttcccg tgtaaaatgg tatgatttta 1921 taagtgatgc ggccctggag cgcggttttg catgttccgt atgggataac ggcgtttttg 1981 gttcattgga taatgacatg gctatttaca acagagatac ccgtaccttt gacactgaaa 2041 tcctcaatgc actatttaat cccggaacat atccgtctta ttctccgaaa ccttcaccaa 2101 ctccaagacc gaccaaaccg cccgtaacac cggctgtcgg tgaaaaaatg ctggatgatt 2161 ttgagggtgt gttaaattgg ggttcatact ccggtgaagg tgcaaaagtt tcaacaaaaa 2221 ttgtgtccgg aaaaacagga aacggcatgg aagtcagcta caccgggaca acggacggct 2281 actggggaac agtatacagt ttaccggacg gcgattggtc aaaatggctt aaaatctctt 2341 ttgacattaa gtccgttgac ggttctgcca atgaaatcag atttatgatt gctgaaaaaa 2401 gcataaacgg tgtgggagac ggagaacact gggtttactc aataactccc gacagttcgt 2461 ggaaaactat agaaataccg ttctccagct ttagaagaag acttgattat cagccgcctg 2521 gacaggatat gagcggtact ttggatcttg acaatataga ttcaattcac ttcatgtatg 2581 ccaacaacaa gtcgggaaaa tttgtcgtag acaatatcaa gctgattggt gctacttccg 2641 atccgactcc ttcaataaaa cacggagatt tgaacttcga taatgcagtg aattctacag 2701 acttgttaat gcttaaaagg tatatcctca aatctttgga actcggtaca tctgagcagg 2761 aggaaaaatt caaaaaagcg gcagatttaa acagggacaa caaggtcgac tccactgact 2821 tgacaatttt gaaaagatac ttgctgaaag ccatcagtga aatacccata taaatttcag 2881 gcataaattt tcaggcaaaa tttaatttat tataaaatac tgtgggcatg ctgcaaatag 2941 gattagaatc accaccctca aaaatcctgt aaaaagcatg cccacaataa attttatttc 3001 at //