GenBank-Updates@genbank.bio.net (10/23/90)
LOCUS TRRCBHIIQ 2774 bp ds-DNA PLN 23-OCT-1990
DEFINITION Trichoderma reesei cellobiohydrolase II gene, complete cds.
ACCESSION M55080
KEYWORDS cellobiohydrolase II.
SOURCE Trichoderma reesei (strain QM9414) DNA.
ORGANISM Trichoderma reesei
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Hypocreales; Hypocreaceae.
REFERENCE 1 (bases 1 to 2774)
AUTHORS Chen,C.M., Gritzali,M. and Stafford,D.W.
TITLE Nucleotide sequence and deduced primary structure of
cellobiohydrolase II from Trichoderma reesei
JOURNAL Bio/Technology 5, 274-278 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 613 704 cellobiohydrolase II, exon 1
754 1140 cellobiohydrolase II, exon 2
1197 1444 cellobiohydrolase II, exon 3
1535 2223 cellobiohydrolase II, exon 4
sigp 613 684 cellobiohydrolase II signal peptide
matp 685 704 cellobiohydrolase II
754 1140 cellobiohydrolase II
1197 1444 cellobiohydrolase II
1535 2220 cellobiohydrolase II
IVS 705 753 cellobiohydrolase II intron
IVS 1141 1196 cellobiohydrolase II intron
IVS 1445 1534 cellobiohydrolase II intron
BASE COUNT 647 a 781 c 663 g 683 t
ORIGIN
1 gaattctagg ctaggtatgc gaggcacgcg gatctagggc agactgggca ttgcatagct
61 atggtgtagt agaactcccg tcaacggcta ttctcaccta gactttcccc ttcgaactga
121 caagttgtta tattgcctgt gtaccaagcg ctaatgtgga caggattaat gccagagttc
181 attagcctca agtagagcct atttcctcgc cggaaagtca tctctcttat tgcatttctg
241 ccttccacta actcagggtg cagcgcaaca ctacacgcaa catatcacat ttattagccg
301 tgcaacaagg ctattctacg aaaaatgcta cactccacat gttaaaggcg cattcaacca
361 gcttctttat tgggtccata cagccaggcg gggatcaagc tcattagccg ccactcaagg
421 ctatacaatg ttgccaactc tccgggcttt atcctgtgct cccgaatacc acatcgtgat
481 gatgcttcag cgcacggaag tcacagacac cgcctgtata aaagggggac tgtgaccctg
541 tatgaggcgc aacatggtct cacagcagct cacctgaaga ggcttgtaag atcaccctct
601 gtgtattgca ccatgattgt cggcattctc accacgctgg ctacgctggc cacactcgca
661 gctagtgtgc ctctagagga gcggcaagct tgctcaagcg tctggtaatt atgtgaaccc
721 tctcaagaga cccaaatact gagatatgtc aaggggccaa tgtggtggcc agaattggtc
781 gggtccgact tgctgtgctt ccggaagcac atgcgtctac tccaacgact attactccca
841 gtgtcttccc ggcgctgcaa gctcaagctc gtccacgcgc gccgcgtcga cgacttctcg
901 agtatccccc acaacatccc ggtcgagctc cgcgacgcct ccacctggtt ctactactac
961 cagagtacct ccagtcggat cgggaaccgc tacgtattca ggcaaccctt ttgttggggt
1021 cactccttgg gccaatgcat attacgcctc tgaagttagc agcctcgcta ttcctagctt
1081 gactggagcc atggccactg ctgcagcagc tgtcgcaaag gttccctctt ttatgtggct
1141 gtaggtcctc ccggaaccaa ggcaatctgt tactgaaggc tcatcattca ctgcagagat
1201 actcttgaca agacccctct catggagcaa accttggccg acatccgcac cgccaacaag
1261 aatggcggta actatgcggg acagtttgtg gtgtatgact tgccggatcg cgattgcgct
1321 gcccttgcct cgaatggcga atactctatt gccgatggtg gcgtcgccaa atataagaac
1381 tatatcgaca ccattcgtca aattgtcgtg gaatattccg atatccggac cctcctggtt
1441 attggtatga gtttaaacac ctgcctcccc ccccccttcc cttcctttcc cgccggcatc
1501 ttgtcgttgt gctaactatt gttccctctt ccagagcctg actctcttgc caacctggtg
1561 accaacctcg gtactccaaa gtgtgccaat gctcagtcag cctaccttga gtgcatcaac
1621 tacgccgtca cacagctgaa ccttccaaat gttgcgatgt atttggacgc tggccatgca
1681 ggatggcttg gctggccggc aaaccaagac ccggccgctc agctatttgc aaatgtttac
1741 aagaatgcat cgtctccgag agctcttcgc ggattggcaa ccaatgtcgc caactacaac
1801 gggtggaaca ttaccagccc cccatcgtac acgcaaggca acgctgtcta caacgagaag
1861 ctgtacatcc acgctattgg acgtcttctt gccaatcacg gctggtccaa cgccttcttc
1921 atcactgatc aaggtcgatc gggaaagcag cctaccggac agcaacagtg gggagactgg
1981 tgcaatgtga tcggcaccgg atttggtatt cgcccatccg caaacactgg ggactcgttg
2041 ctggattcgt ttgtctgggt caagccaggc ggcgagtgtg acggcaccag cgacagcagt
2101 gcgccacgat ttgactccca ctgtgcgctc ccagatgcct tgcaaccggc ggctcaagct
2161 ggtgcttggt tccaagccta ctttgtgcag cttctcacaa acgcaaaccc atcgttcctg
2221 taaggctttc gtgaccgggc ttcaaacaat gatgtgcgat ggtgtggttc ccggttggcg
2281 gagtctttgt ctactttggt tgtctgtcgc aggtcggtag accgcaaatg agcaactgat
2341 ggattgttgc cagcatacta taattcacat ggatggtctt tgtcgatcag tagctagtga
2401 gagagagaga acatctatcc acaatgtcga gtgtctatta gacatactcc gagaataaag
2461 tcaacctttc tgtgatctaa agatcgattc ggcagtcgag tagcgtataa caactccgag
2521 taccagcaaa agcacgtcgt gacaggagcg gctttgccaa ctgcgcaacc ttgcttgaat
2581 gaggatacac ggggtgcaac atggctgtac tgatccatcg caaccaaaat ttctgtttat
2641 agatcaagct ggtagattcc aattactcca cctcttgcgc ttctccatga catgtaagtg
2701 cacgtggaaa ccatacccaa attgcctaca gctgcggagc atgagcctat ggcgatcagt
2761 ctggtcatgt taac
//GenBank-Updates@genbank.bio.net (10/24/90)
LOCUS TRRCBHIIQ 2774 bp ds-DNA PLN 24-OCT-1990
DEFINITION Trichoderma reesei cellobiohydrolase II gene, complete cds.
ACCESSION M55080
KEYWORDS cellobiohydrolase II.
SOURCE Trichoderma reesei (strain QM9414) DNA.
ORGANISM Trichoderma reesei
Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina;
Pyrenomycetes; Hypocreales; Hypocreaceae.
REFERENCE 1 (bases 1 to 2774)
AUTHORS Chen,C.M., Gritzali,M. and Stafford,D.W.
TITLE Nucleotide sequence and deduced primary structure of
cellobiohydrolase II from Trichoderma reesei
JOURNAL Bio/Technology 5, 274-278 (1987)
STANDARD simple staff_entry
FEATURES from to/span description
pept 613 704 cellobiohydrolase II, exon 1
754 1140 cellobiohydrolase II, exon 2
1197 1444 cellobiohydrolase II, exon 3
1535 2223 cellobiohydrolase II, exon 4
sigp 613 684 cellobiohydrolase II signal peptide
matp 685 704 cellobiohydrolase II
754 1140 cellobiohydrolase II
1197 1444 cellobiohydrolase II
1535 2220 cellobiohydrolase II
IVS 705 753 cellobiohydrolase II intron
IVS 1141 1196 cellobiohydrolase II intron
IVS 1445 1534 cellobiohydrolase II intron
BASE COUNT 647 a 781 c 663 g 683 t
ORIGIN
1 gaattctagg ctaggtatgc gaggcacgcg gatctagggc agactgggca ttgcatagct
61 atggtgtagt agaactcccg tcaacggcta ttctcaccta gactttcccc ttcgaactga
121 caagttgtta tattgcctgt gtaccaagcg ctaatgtgga caggattaat gccagagttc
181 attagcctca agtagagcct atttcctcgc cggaaagtca tctctcttat tgcatttctg
241 ccttccacta actcagggtg cagcgcaaca ctacacgcaa catatcacat ttattagccg
301 tgcaacaagg ctattctacg aaaaatgcta cactccacat gttaaaggcg cattcaacca
361 gcttctttat tgggtccata cagccaggcg gggatcaagc tcattagccg ccactcaagg
421 ctatacaatg ttgccaactc tccgggcttt atcctgtgct cccgaatacc acatcgtgat
481 gatgcttcag cgcacggaag tcacagacac cgcctgtata aaagggggac tgtgaccctg
541 tatgaggcgc aacatggtct cacagcagct cacctgaaga ggcttgtaag atcaccctct
601 gtgtattgca ccatgattgt cggcattctc accacgctgg ctacgctggc cacactcgca
661 gctagtgtgc ctctagagga gcggcaagct tgctcaagcg tctggtaatt atgtgaaccc
721 tctcaagaga cccaaatact gagatatgtc aaggggccaa tgtggtggcc agaattggtc
781 gggtccgact tgctgtgctt ccggaagcac atgcgtctac tccaacgact attactccca
841 gtgtcttccc ggcgctgcaa gctcaagctc gtccacgcgc gccgcgtcga cgacttctcg
901 agtatccccc acaacatccc ggtcgagctc cgcgacgcct ccacctggtt ctactactac
961 cagagtacct ccagtcggat cgggaaccgc tacgtattca ggcaaccctt ttgttggggt
1021 cactccttgg gccaatgcat attacgcctc tgaagttagc agcctcgcta ttcctagctt
1081 gactggagcc atggccactg ctgcagcagc tgtcgcaaag gttccctctt ttatgtggct
1141 gtaggtcctc ccggaaccaa ggcaatctgt tactgaaggc tcatcattca ctgcagagat
1201 actcttgaca agacccctct catggagcaa accttggccg acatccgcac cgccaacaag
1261 aatggcggta actatgcggg acagtttgtg gtgtatgact tgccggatcg cgattgcgct
1321 gcccttgcct cgaatggcga atactctatt gccgatggtg gcgtcgccaa atataagaac
1381 tatatcgaca ccattcgtca aattgtcgtg gaatattccg atatccggac cctcctggtt
1441 attggtatga gtttaaacac ctgcctcccc ccccccttcc cttcctttcc cgccggcatc
1501 ttgtcgttgt gctaactatt gttccctctt ccagagcctg actctcttgc caacctggtg
1561 accaacctcg gtactccaaa gtgtgccaat gctcagtcag cctaccttga gtgcatcaac
1621 tacgccgtca cacagctgaa ccttccaaat gttgcgatgt atttggacgc tggccatgca
1681 ggatggcttg gctggccggc aaaccaagac ccggccgctc agctatttgc aaatgtttac
1741 aagaatgcat cgtctccgag agctcttcgc ggattggcaa ccaatgtcgc caactacaac
1801 gggtggaaca ttaccagccc cccatcgtac acgcaaggca acgctgtcta caacgagaag
1861 ctgtacatcc acgctattgg acgtcttctt gccaatcacg gctggtccaa cgccttcttc
1921 atcactgatc aaggtcgatc gggaaagcag cctaccggac agcaacagtg gggagactgg
1981 tgcaatgtga tcggcaccgg atttggtatt cgcccatccg caaacactgg ggactcgttg
2041 ctggattcgt ttgtctgggt caagccaggc ggcgagtgtg acggcaccag cgacagcagt
2101 gcgccacgat ttgactccca ctgtgcgctc ccagatgcct tgcaaccggc ggctcaagct
2161 ggtgcttggt tccaagccta ctttgtgcag cttctcacaa acgcaaaccc atcgttcctg
2221 taaggctttc gtgaccgggc ttcaaacaat gatgtgcgat ggtgtggttc ccggttggcg
2281 gagtctttgt ctactttggt tgtctgtcgc aggtcggtag accgcaaatg agcaactgat
2341 ggattgttgc cagcatacta taattcacat ggatggtctt tgtcgatcag tagctagtga
2401 gagagagaga acatctatcc acaatgtcga gtgtctatta gacatactcc gagaataaag
2461 tcaacctttc tgtgatctaa agatcgattc ggcagtcgag tagcgtataa caactccgag
2521 taccagcaaa agcacgtcgt gacaggagcg gctttgccaa ctgcgcaacc ttgcttgaat
2581 gaggatacac ggggtgcaac atggctgtac tgatccatcg caaccaaaat ttctgtttat
2641 agatcaagct ggtagattcc aattactcca cctcttgcgc ttctccatga catgtaagtg
2701 cacgtggaaa ccatacccaa attgcctaca gctgcggagc atgagcctat ggcgatcagt
2761 ctggtcatgt taac
//