[bionet.molbio.genbank.updates] Theileria parva cysteine protease gene, complete cds.

GenBank-Updates@genbank.bio.net (12/04/90)

LOCUS       THECYSPTS    1522 bp ds-DNA             INV       04-DEC-1990
DEFINITION  Theileria parva cysteine protease gene, complete cds.
ACCESSION   M37791
KEYWORDS    cysteine protease; thiol protease.
SOURCE      Theileria parva blood stage, cDNA to mRNA, and DNA.
  ORGANISM  Theileria parva
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
            Piroplasmia; Piroplasmida; Theileriidae.
REFERENCE   1  (bases 1 to 1522)
  AUTHORS   Nene,V., Gobright,E., Musoke,A.J. and Lonsdale-Eccles,J.D.
  TITLE     A single exon codes for the enzyme domain of a protozoan cysteine
            protease
  JOURNAL   J. Biol. Chem. 265, 18047-18050 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [J. Biol. Chem.
            (1990) In press] kindly submitted
            by V.Nene, 21-AUG-1990.
FEATURES       from  to/span     description
    pept         96      573     cysteine protease precursor, exon 1
                606     1447     cysteine protease precursor, exon 2
    sigp        225      257     cysteine protease signal peptide
    matp        809     1444     cysteine protease 
    IVS         574      605     cysteine protease intron
BASE COUNT      434 a    290 c    311 g    487 t
ORIGIN      Chromosome four
        1 tttactttgg aatatttcca tattcttcca ttattttaac aggaactgtt tttatttaat
       61 ttttaatttt taatttttgt taatttgtat taaccatggt tagttccgtt gtttctaacc
      121 cgaatgaacg ccttgttaat aacagggtcg agaatgattt ggagtcgtct gatgatactt
      181 tatctactca ggccaagcct gtttctcgtt tattgactag gaaactcctt ttgggtgttg
      241 tcgttttatt ctttttggct ggcgtttccg tcgtttctta ctttctcttt agtaaataca
      301 agatgttaaa taagtttaag agggagttgg atgaccactt gactaaggat tttccgaacc
      361 tagaaaggtc taaacgtgac acttgtttcg acgagttgac tagactcttt ggtgacggtt
      421 tcctatctga cgatcctaaa cttgaatacg aggtttaccg tgaatttgaa gaatttaact
      481 ccaaatacaa cagacgtcat gccactcagc aggagcgtct caacagactc gtcactttcc
      541 gctctaacta tctcgaggtt aaagaacaaa agggttagtt acacacactt aatattattt
      601 tataggtgat gaaccatatg tcaagggtat caacagattc agtgacctca ctgaaagaga
      661 attctacaaa ttgttcccag taatgaaacc accaaaggca acatattcta atggttacta
      721 cctattatcc cacatggcaa acaagactta cctgaagaac ctgaaaaagg ccttaaacac
      781 tgatgaagat gttgacctcg ctaaactcac tggtgagaat cttgactgga ggagatcttc
      841 atcagttact tctgttaagg accagagtaa ctgtggtggc tgctgggcat tctcaactgt
      901 aggttcagtt gagggctatt acatgtctca ctttgacaag agttacgaac tcagtgtcca
      961 agaattattg gactgtgaca gttttagcaa cggatgccaa ggtggtttat tggaatcggc
     1021 ctatgaatat gttagaaagt acggtttagt atcagccaaa gacttgcctt ttgtagacaa
     1081 ggctagaaga tgttccgtac caaaggccaa aaaggtcagt gtaccatcat accatgtttt
     1141 caaggggaaa gaagtcatga ctagatccct cacttcctcg ccctgctctg tatatctatc
     1201 tgtttcaccc gaacttgcca agtataaatc tggtgttttc actggtgaat gcggcaaatc
     1261 acttaaccac gcagttgtgc tggtaggtga aggctacgat gaagtcacca aaaagagata
     1321 ctgggttgta caaaactcct ggggaactga ttggggcgaa aacggctata tgagactaga
     1381 aagaactaac atgggaacgg ataaatgcgg tgttcttgac acctcaatgt ctgcatttga
     1441 actttaataa tatgttttta ttggttctct aacattgctt tttatacctt ggatcgtgtc
     1501 ctatttcttc tttgaagaat tc
//

GenBank-Updates@genbank.bio.net (01/31/91)

LOCUS       THECYSPTS    1522 bp ds-DNA             INV       31-JAN-1991
DEFINITION  Theileria parva cysteine protease gene, complete cds.
ACCESSION   M37791
KEYWORDS    cysteine protease; thiol protease.
SOURCE      Theileria parva blood stage, cDNA to mRNA, and DNA.
  ORGANISM  Theileria parva
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea;
            Piroplasmia; Piroplasmida; Theileriidae.
REFERENCE   1  (bases 1 to 1522)
  AUTHORS   Nene,V., Gobright,E., Musoke,A.J. and Lonsdale-Eccles,J.D.
  TITLE     A single exon codes for the enzyme domain of a protozoan cysteine
            protease
  JOURNAL   J. Biol. Chem. 265, 18047-18050 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [J. Biol. Chem.
            (1990) In press] kindly submitted
            by V.Nene, 21-AUG-1990.
            
            
FEATURES             Location/Qualifiers
     CDS             join(96..573,606..1447)
                     /product="cysteine protease"
     sig_peptide     225..257
                     /gene="cysteine protease"
                     /codon_start=225
     intron          574..605
                     /gene="cysteine protease"
     mat_peptide     809..1444
                     /product="cysteine protease"
                     /gene="cysteine protease"
                     /codon_start=809
BASE COUNT      434 a    290 c    311 g    487 t
ORIGIN      Chromosome four
        1 tttactttgg aatatttcca tattcttcca ttattttaac aggaactgtt tttatttaat
       61 ttttaatttt taatttttgt taatttgtat taaccatggt tagttccgtt gtttctaacc
      121 cgaatgaacg ccttgttaat aacagggtcg agaatgattt ggagtcgtct gatgatactt
      181 tatctactca ggccaagcct gtttctcgtt tattgactag gaaactcctt ttgggtgttg
      241 tcgttttatt ctttttggct ggcgtttccg tcgtttctta ctttctcttt agtaaataca
      301 agatgttaaa taagtttaag agggagttgg atgaccactt gactaaggat tttccgaacc
      361 tagaaaggtc taaacgtgac acttgtttcg acgagttgac tagactcttt ggtgacggtt
      421 tcctatctga cgatcctaaa cttgaatacg aggtttaccg tgaatttgaa gaatttaact
      481 ccaaatacaa cagacgtcat gccactcagc aggagcgtct caacagactc gtcactttcc
      541 gctctaacta tctcgaggtt aaagaacaaa agggttagtt acacacactt aatattattt
      601 tataggtgat gaaccatatg tcaagggtat caacagattc agtgacctca ctgaaagaga
      661 attctacaaa ttgttcccag taatgaaacc accaaaggca acatattcta atggttacta
      721 cctattatcc cacatggcaa acaagactta cctgaagaac ctgaaaaagg ccttaaacac
      781 tgatgaagat gttgacctcg ctaaactcac tggtgagaat cttgactgga ggagatcttc
      841 atcagttact tctgttaagg accagagtaa ctgtggtggc tgctgggcat tctcaactgt
      901 aggttcagtt gagggctatt acatgtctca ctttgacaag agttacgaac tcagtgtcca
      961 agaattattg gactgtgaca gttttagcaa cggatgccaa ggtggtttat tggaatcggc
     1021 ctatgaatat gttagaaagt acggtttagt atcagccaaa gacttgcctt ttgtagacaa
     1081 ggctagaaga tgttccgtac caaaggccaa aaaggtcagt gtaccatcat accatgtttt
     1141 caaggggaaa gaagtcatga ctagatccct cacttcctcg ccctgctctg tatatctatc
     1201 tgtttcaccc gaacttgcca agtataaatc tggtgttttc actggtgaat gcggcaaatc
     1261 acttaaccac gcagttgtgc tggtaggtga aggctacgat gaagtcacca aaaagagata
     1321 ctgggttgta caaaactcct ggggaactga ttggggcgaa aacggctata tgagactaga
     1381 aagaactaac atgggaacgg ataaatgcgg tgttcttgac acctcaatgt ctgcatttga
     1441 actttaataa tatgttttta ttggttctct aacattgctt tttatacctt ggatcgtgtc
     1501 ctatttcttc tttgaagaat tc
//