GenBank-Updates@genbank.bio.net (12/04/90)
LOCUS THECYSPTS 1522 bp ds-DNA INV 04-DEC-1990 DEFINITION Theileria parva cysteine protease gene, complete cds. ACCESSION M37791 KEYWORDS cysteine protease; thiol protease. SOURCE Theileria parva blood stage, cDNA to mRNA, and DNA. ORGANISM Theileria parva Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Piroplasmia; Piroplasmida; Theileriidae. REFERENCE 1 (bases 1 to 1522) AUTHORS Nene,V., Gobright,E., Musoke,A.J. and Lonsdale-Eccles,J.D. TITLE A single exon codes for the enzyme domain of a protozoan cysteine protease JOURNAL J. Biol. Chem. 265, 18047-18050 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [J. Biol. Chem. (1990) In press] kindly submitted by V.Nene, 21-AUG-1990. FEATURES from to/span description pept 96 573 cysteine protease precursor, exon 1 606 1447 cysteine protease precursor, exon 2 sigp 225 257 cysteine protease signal peptide matp 809 1444 cysteine protease IVS 574 605 cysteine protease intron BASE COUNT 434 a 290 c 311 g 487 t ORIGIN Chromosome four 1 tttactttgg aatatttcca tattcttcca ttattttaac aggaactgtt tttatttaat 61 ttttaatttt taatttttgt taatttgtat taaccatggt tagttccgtt gtttctaacc 121 cgaatgaacg ccttgttaat aacagggtcg agaatgattt ggagtcgtct gatgatactt 181 tatctactca ggccaagcct gtttctcgtt tattgactag gaaactcctt ttgggtgttg 241 tcgttttatt ctttttggct ggcgtttccg tcgtttctta ctttctcttt agtaaataca 301 agatgttaaa taagtttaag agggagttgg atgaccactt gactaaggat tttccgaacc 361 tagaaaggtc taaacgtgac acttgtttcg acgagttgac tagactcttt ggtgacggtt 421 tcctatctga cgatcctaaa cttgaatacg aggtttaccg tgaatttgaa gaatttaact 481 ccaaatacaa cagacgtcat gccactcagc aggagcgtct caacagactc gtcactttcc 541 gctctaacta tctcgaggtt aaagaacaaa agggttagtt acacacactt aatattattt 601 tataggtgat gaaccatatg tcaagggtat caacagattc agtgacctca ctgaaagaga 661 attctacaaa ttgttcccag taatgaaacc accaaaggca acatattcta atggttacta 721 cctattatcc cacatggcaa acaagactta cctgaagaac ctgaaaaagg ccttaaacac 781 tgatgaagat gttgacctcg ctaaactcac tggtgagaat cttgactgga ggagatcttc 841 atcagttact tctgttaagg accagagtaa ctgtggtggc tgctgggcat tctcaactgt 901 aggttcagtt gagggctatt acatgtctca ctttgacaag agttacgaac tcagtgtcca 961 agaattattg gactgtgaca gttttagcaa cggatgccaa ggtggtttat tggaatcggc 1021 ctatgaatat gttagaaagt acggtttagt atcagccaaa gacttgcctt ttgtagacaa 1081 ggctagaaga tgttccgtac caaaggccaa aaaggtcagt gtaccatcat accatgtttt 1141 caaggggaaa gaagtcatga ctagatccct cacttcctcg ccctgctctg tatatctatc 1201 tgtttcaccc gaacttgcca agtataaatc tggtgttttc actggtgaat gcggcaaatc 1261 acttaaccac gcagttgtgc tggtaggtga aggctacgat gaagtcacca aaaagagata 1321 ctgggttgta caaaactcct ggggaactga ttggggcgaa aacggctata tgagactaga 1381 aagaactaac atgggaacgg ataaatgcgg tgttcttgac acctcaatgt ctgcatttga 1441 actttaataa tatgttttta ttggttctct aacattgctt tttatacctt ggatcgtgtc 1501 ctatttcttc tttgaagaat tc //
GenBank-Updates@genbank.bio.net (01/31/91)
LOCUS THECYSPTS 1522 bp ds-DNA INV 31-JAN-1991 DEFINITION Theileria parva cysteine protease gene, complete cds. ACCESSION M37791 KEYWORDS cysteine protease; thiol protease. SOURCE Theileria parva blood stage, cDNA to mRNA, and DNA. ORGANISM Theileria parva Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Piroplasmia; Piroplasmida; Theileriidae. REFERENCE 1 (bases 1 to 1522) AUTHORS Nene,V., Gobright,E., Musoke,A.J. and Lonsdale-Eccles,J.D. TITLE A single exon codes for the enzyme domain of a protozoan cysteine protease JOURNAL J. Biol. Chem. 265, 18047-18050 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [J. Biol. Chem. (1990) In press] kindly submitted by V.Nene, 21-AUG-1990. FEATURES Location/Qualifiers CDS join(96..573,606..1447) /product="cysteine protease" sig_peptide 225..257 /gene="cysteine protease" /codon_start=225 intron 574..605 /gene="cysteine protease" mat_peptide 809..1444 /product="cysteine protease" /gene="cysteine protease" /codon_start=809 BASE COUNT 434 a 290 c 311 g 487 t ORIGIN Chromosome four 1 tttactttgg aatatttcca tattcttcca ttattttaac aggaactgtt tttatttaat 61 ttttaatttt taatttttgt taatttgtat taaccatggt tagttccgtt gtttctaacc 121 cgaatgaacg ccttgttaat aacagggtcg agaatgattt ggagtcgtct gatgatactt 181 tatctactca ggccaagcct gtttctcgtt tattgactag gaaactcctt ttgggtgttg 241 tcgttttatt ctttttggct ggcgtttccg tcgtttctta ctttctcttt agtaaataca 301 agatgttaaa taagtttaag agggagttgg atgaccactt gactaaggat tttccgaacc 361 tagaaaggtc taaacgtgac acttgtttcg acgagttgac tagactcttt ggtgacggtt 421 tcctatctga cgatcctaaa cttgaatacg aggtttaccg tgaatttgaa gaatttaact 481 ccaaatacaa cagacgtcat gccactcagc aggagcgtct caacagactc gtcactttcc 541 gctctaacta tctcgaggtt aaagaacaaa agggttagtt acacacactt aatattattt 601 tataggtgat gaaccatatg tcaagggtat caacagattc agtgacctca ctgaaagaga 661 attctacaaa ttgttcccag taatgaaacc accaaaggca acatattcta atggttacta 721 cctattatcc cacatggcaa acaagactta cctgaagaac ctgaaaaagg ccttaaacac 781 tgatgaagat gttgacctcg ctaaactcac tggtgagaat cttgactgga ggagatcttc 841 atcagttact tctgttaagg accagagtaa ctgtggtggc tgctgggcat tctcaactgt 901 aggttcagtt gagggctatt acatgtctca ctttgacaag agttacgaac tcagtgtcca 961 agaattattg gactgtgaca gttttagcaa cggatgccaa ggtggtttat tggaatcggc 1021 ctatgaatat gttagaaagt acggtttagt atcagccaaa gacttgcctt ttgtagacaa 1081 ggctagaaga tgttccgtac caaaggccaa aaaggtcagt gtaccatcat accatgtttt 1141 caaggggaaa gaagtcatga ctagatccct cacttcctcg ccctgctctg tatatctatc 1201 tgtttcaccc gaacttgcca agtataaatc tggtgttttc actggtgaat gcggcaaatc 1261 acttaaccac gcagttgtgc tggtaggtga aggctacgat gaagtcacca aaaagagata 1321 ctgggttgta caaaactcct ggggaactga ttggggcgaa aacggctata tgagactaga 1381 aagaactaac atgggaacgg ataaatgcgg tgttcttgac acctcaatgt ctgcatttga 1441 actttaataa tatgttttta ttggttctct aacattgctt tttatacctt ggatcgtgtc 1501 ctatttcttc tttgaagaat tc //