GenBank-Updates@genbank.bio.net (10/23/90)
LOCUS TRRCBHIIQ 2774 bp ds-DNA PLN 23-OCT-1990 DEFINITION Trichoderma reesei cellobiohydrolase II gene, complete cds. ACCESSION M55080 KEYWORDS cellobiohydrolase II. SOURCE Trichoderma reesei (strain QM9414) DNA. ORGANISM Trichoderma reesei Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Hypocreales; Hypocreaceae. REFERENCE 1 (bases 1 to 2774) AUTHORS Chen,C.M., Gritzali,M. and Stafford,D.W. TITLE Nucleotide sequence and deduced primary structure of cellobiohydrolase II from Trichoderma reesei JOURNAL Bio/Technology 5, 274-278 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 613 704 cellobiohydrolase II, exon 1 754 1140 cellobiohydrolase II, exon 2 1197 1444 cellobiohydrolase II, exon 3 1535 2223 cellobiohydrolase II, exon 4 sigp 613 684 cellobiohydrolase II signal peptide matp 685 704 cellobiohydrolase II 754 1140 cellobiohydrolase II 1197 1444 cellobiohydrolase II 1535 2220 cellobiohydrolase II IVS 705 753 cellobiohydrolase II intron IVS 1141 1196 cellobiohydrolase II intron IVS 1445 1534 cellobiohydrolase II intron BASE COUNT 647 a 781 c 663 g 683 t ORIGIN 1 gaattctagg ctaggtatgc gaggcacgcg gatctagggc agactgggca ttgcatagct 61 atggtgtagt agaactcccg tcaacggcta ttctcaccta gactttcccc ttcgaactga 121 caagttgtta tattgcctgt gtaccaagcg ctaatgtgga caggattaat gccagagttc 181 attagcctca agtagagcct atttcctcgc cggaaagtca tctctcttat tgcatttctg 241 ccttccacta actcagggtg cagcgcaaca ctacacgcaa catatcacat ttattagccg 301 tgcaacaagg ctattctacg aaaaatgcta cactccacat gttaaaggcg cattcaacca 361 gcttctttat tgggtccata cagccaggcg gggatcaagc tcattagccg ccactcaagg 421 ctatacaatg ttgccaactc tccgggcttt atcctgtgct cccgaatacc acatcgtgat 481 gatgcttcag cgcacggaag tcacagacac cgcctgtata aaagggggac tgtgaccctg 541 tatgaggcgc aacatggtct cacagcagct cacctgaaga ggcttgtaag atcaccctct 601 gtgtattgca ccatgattgt cggcattctc accacgctgg ctacgctggc cacactcgca 661 gctagtgtgc ctctagagga gcggcaagct tgctcaagcg tctggtaatt atgtgaaccc 721 tctcaagaga cccaaatact gagatatgtc aaggggccaa tgtggtggcc agaattggtc 781 gggtccgact tgctgtgctt ccggaagcac atgcgtctac tccaacgact attactccca 841 gtgtcttccc ggcgctgcaa gctcaagctc gtccacgcgc gccgcgtcga cgacttctcg 901 agtatccccc acaacatccc ggtcgagctc cgcgacgcct ccacctggtt ctactactac 961 cagagtacct ccagtcggat cgggaaccgc tacgtattca ggcaaccctt ttgttggggt 1021 cactccttgg gccaatgcat attacgcctc tgaagttagc agcctcgcta ttcctagctt 1081 gactggagcc atggccactg ctgcagcagc tgtcgcaaag gttccctctt ttatgtggct 1141 gtaggtcctc ccggaaccaa ggcaatctgt tactgaaggc tcatcattca ctgcagagat 1201 actcttgaca agacccctct catggagcaa accttggccg acatccgcac cgccaacaag 1261 aatggcggta actatgcggg acagtttgtg gtgtatgact tgccggatcg cgattgcgct 1321 gcccttgcct cgaatggcga atactctatt gccgatggtg gcgtcgccaa atataagaac 1381 tatatcgaca ccattcgtca aattgtcgtg gaatattccg atatccggac cctcctggtt 1441 attggtatga gtttaaacac ctgcctcccc ccccccttcc cttcctttcc cgccggcatc 1501 ttgtcgttgt gctaactatt gttccctctt ccagagcctg actctcttgc caacctggtg 1561 accaacctcg gtactccaaa gtgtgccaat gctcagtcag cctaccttga gtgcatcaac 1621 tacgccgtca cacagctgaa ccttccaaat gttgcgatgt atttggacgc tggccatgca 1681 ggatggcttg gctggccggc aaaccaagac ccggccgctc agctatttgc aaatgtttac 1741 aagaatgcat cgtctccgag agctcttcgc ggattggcaa ccaatgtcgc caactacaac 1801 gggtggaaca ttaccagccc cccatcgtac acgcaaggca acgctgtcta caacgagaag 1861 ctgtacatcc acgctattgg acgtcttctt gccaatcacg gctggtccaa cgccttcttc 1921 atcactgatc aaggtcgatc gggaaagcag cctaccggac agcaacagtg gggagactgg 1981 tgcaatgtga tcggcaccgg atttggtatt cgcccatccg caaacactgg ggactcgttg 2041 ctggattcgt ttgtctgggt caagccaggc ggcgagtgtg acggcaccag cgacagcagt 2101 gcgccacgat ttgactccca ctgtgcgctc ccagatgcct tgcaaccggc ggctcaagct 2161 ggtgcttggt tccaagccta ctttgtgcag cttctcacaa acgcaaaccc atcgttcctg 2221 taaggctttc gtgaccgggc ttcaaacaat gatgtgcgat ggtgtggttc ccggttggcg 2281 gagtctttgt ctactttggt tgtctgtcgc aggtcggtag accgcaaatg agcaactgat 2341 ggattgttgc cagcatacta taattcacat ggatggtctt tgtcgatcag tagctagtga 2401 gagagagaga acatctatcc acaatgtcga gtgtctatta gacatactcc gagaataaag 2461 tcaacctttc tgtgatctaa agatcgattc ggcagtcgag tagcgtataa caactccgag 2521 taccagcaaa agcacgtcgt gacaggagcg gctttgccaa ctgcgcaacc ttgcttgaat 2581 gaggatacac ggggtgcaac atggctgtac tgatccatcg caaccaaaat ttctgtttat 2641 agatcaagct ggtagattcc aattactcca cctcttgcgc ttctccatga catgtaagtg 2701 cacgtggaaa ccatacccaa attgcctaca gctgcggagc atgagcctat ggcgatcagt 2761 ctggtcatgt taac //
GenBank-Updates@genbank.bio.net (10/24/90)
LOCUS TRRCBHIIQ 2774 bp ds-DNA PLN 24-OCT-1990 DEFINITION Trichoderma reesei cellobiohydrolase II gene, complete cds. ACCESSION M55080 KEYWORDS cellobiohydrolase II. SOURCE Trichoderma reesei (strain QM9414) DNA. ORGANISM Trichoderma reesei Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Hypocreales; Hypocreaceae. REFERENCE 1 (bases 1 to 2774) AUTHORS Chen,C.M., Gritzali,M. and Stafford,D.W. TITLE Nucleotide sequence and deduced primary structure of cellobiohydrolase II from Trichoderma reesei JOURNAL Bio/Technology 5, 274-278 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 613 704 cellobiohydrolase II, exon 1 754 1140 cellobiohydrolase II, exon 2 1197 1444 cellobiohydrolase II, exon 3 1535 2223 cellobiohydrolase II, exon 4 sigp 613 684 cellobiohydrolase II signal peptide matp 685 704 cellobiohydrolase II 754 1140 cellobiohydrolase II 1197 1444 cellobiohydrolase II 1535 2220 cellobiohydrolase II IVS 705 753 cellobiohydrolase II intron IVS 1141 1196 cellobiohydrolase II intron IVS 1445 1534 cellobiohydrolase II intron BASE COUNT 647 a 781 c 663 g 683 t ORIGIN 1 gaattctagg ctaggtatgc gaggcacgcg gatctagggc agactgggca ttgcatagct 61 atggtgtagt agaactcccg tcaacggcta ttctcaccta gactttcccc ttcgaactga 121 caagttgtta tattgcctgt gtaccaagcg ctaatgtgga caggattaat gccagagttc 181 attagcctca agtagagcct atttcctcgc cggaaagtca tctctcttat tgcatttctg 241 ccttccacta actcagggtg cagcgcaaca ctacacgcaa catatcacat ttattagccg 301 tgcaacaagg ctattctacg aaaaatgcta cactccacat gttaaaggcg cattcaacca 361 gcttctttat tgggtccata cagccaggcg gggatcaagc tcattagccg ccactcaagg 421 ctatacaatg ttgccaactc tccgggcttt atcctgtgct cccgaatacc acatcgtgat 481 gatgcttcag cgcacggaag tcacagacac cgcctgtata aaagggggac tgtgaccctg 541 tatgaggcgc aacatggtct cacagcagct cacctgaaga ggcttgtaag atcaccctct 601 gtgtattgca ccatgattgt cggcattctc accacgctgg ctacgctggc cacactcgca 661 gctagtgtgc ctctagagga gcggcaagct tgctcaagcg tctggtaatt atgtgaaccc 721 tctcaagaga cccaaatact gagatatgtc aaggggccaa tgtggtggcc agaattggtc 781 gggtccgact tgctgtgctt ccggaagcac atgcgtctac tccaacgact attactccca 841 gtgtcttccc ggcgctgcaa gctcaagctc gtccacgcgc gccgcgtcga cgacttctcg 901 agtatccccc acaacatccc ggtcgagctc cgcgacgcct ccacctggtt ctactactac 961 cagagtacct ccagtcggat cgggaaccgc tacgtattca ggcaaccctt ttgttggggt 1021 cactccttgg gccaatgcat attacgcctc tgaagttagc agcctcgcta ttcctagctt 1081 gactggagcc atggccactg ctgcagcagc tgtcgcaaag gttccctctt ttatgtggct 1141 gtaggtcctc ccggaaccaa ggcaatctgt tactgaaggc tcatcattca ctgcagagat 1201 actcttgaca agacccctct catggagcaa accttggccg acatccgcac cgccaacaag 1261 aatggcggta actatgcggg acagtttgtg gtgtatgact tgccggatcg cgattgcgct 1321 gcccttgcct cgaatggcga atactctatt gccgatggtg gcgtcgccaa atataagaac 1381 tatatcgaca ccattcgtca aattgtcgtg gaatattccg atatccggac cctcctggtt 1441 attggtatga gtttaaacac ctgcctcccc ccccccttcc cttcctttcc cgccggcatc 1501 ttgtcgttgt gctaactatt gttccctctt ccagagcctg actctcttgc caacctggtg 1561 accaacctcg gtactccaaa gtgtgccaat gctcagtcag cctaccttga gtgcatcaac 1621 tacgccgtca cacagctgaa ccttccaaat gttgcgatgt atttggacgc tggccatgca 1681 ggatggcttg gctggccggc aaaccaagac ccggccgctc agctatttgc aaatgtttac 1741 aagaatgcat cgtctccgag agctcttcgc ggattggcaa ccaatgtcgc caactacaac 1801 gggtggaaca ttaccagccc cccatcgtac acgcaaggca acgctgtcta caacgagaag 1861 ctgtacatcc acgctattgg acgtcttctt gccaatcacg gctggtccaa cgccttcttc 1921 atcactgatc aaggtcgatc gggaaagcag cctaccggac agcaacagtg gggagactgg 1981 tgcaatgtga tcggcaccgg atttggtatt cgcccatccg caaacactgg ggactcgttg 2041 ctggattcgt ttgtctgggt caagccaggc ggcgagtgtg acggcaccag cgacagcagt 2101 gcgccacgat ttgactccca ctgtgcgctc ccagatgcct tgcaaccggc ggctcaagct 2161 ggtgcttggt tccaagccta ctttgtgcag cttctcacaa acgcaaaccc atcgttcctg 2221 taaggctttc gtgaccgggc ttcaaacaat gatgtgcgat ggtgtggttc ccggttggcg 2281 gagtctttgt ctactttggt tgtctgtcgc aggtcggtag accgcaaatg agcaactgat 2341 ggattgttgc cagcatacta taattcacat ggatggtctt tgtcgatcag tagctagtga 2401 gagagagaga acatctatcc acaatgtcga gtgtctatta gacatactcc gagaataaag 2461 tcaacctttc tgtgatctaa agatcgattc ggcagtcgag tagcgtataa caactccgag 2521 taccagcaaa agcacgtcgt gacaggagcg gctttgccaa ctgcgcaacc ttgcttgaat 2581 gaggatacac ggggtgcaac atggctgtac tgatccatcg caaccaaaat ttctgtttat 2641 agatcaagct ggtagattcc aattactcca cctcttgcgc ttctccatga catgtaagtg 2701 cacgtggaaa ccatacccaa attgcctaca gctgcggagc atgagcctat ggcgatcagt 2761 ctggtcatgt taac //