GenBank-Updates@genbank.bio.net (01/16/91)
LOCUS HAECA22 2410 bp ds-DNA INV 16-JAN-1991
DEFINITION H.contortus cathepsin B-like cysteine protease gene, exons 5 - 12.
ACCESSION M60213 M34859 M34860
KEYWORDS cathepsin B-like cysteine protease.
SEGMENT 2 of 2
SOURCE H.contortus (isolate BPL1) adult DNA.
ORGANISM Haemonchus contortus
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria;
Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE 1 (bases 1 to 2410)
AUTHORS Pratt,D., Cox,G.N., Milhausen,M.J. and Boisvenue,R.J.
TITLE A developmentally regulated cysteine protease gene family in
Haemonchus contortus
JOURNAL Mol. Biochem. Parasitol. 43, 181-192 (1990)
STANDARD full staff_entry
FEATURES Location/Qualifiers
intron order(M60212:738..767,1..23)
/note="5.7 kb gap"
/number=4
/gene="AC-2"
CDS join(M60212:262..264,
M60212:345..405,M60212:475..560,M60212:619..737,
24..88,157..230,288..343,423..504,
562..712,1257..1384,1571..1688,1892..1977)
/number=11
/gene="AC-2"
/product="cathepsin B-like cysteine protease"
exon 24..88
/number=5
/gene="AC-2"
/codon_start=24
intron 89..156
/number=5
/gene="AC-2"
exon 157..230
/number=6
/gene="AC-2"
/codon_start=157
intron 231..287
/number=6
/gene="AC-2"
exon 288..343
/number=7
/gene="AC-2"
/codon_start=288
intron 344..422
/number=7
/gene="AC-2"
exon 423..504
/number=8
/gene="AC-2"
/codon_start=423
intron 505..561
/number=8
/gene="AC-2"
exon 562..712
/number=9
/gene="AC-2"
/codon_start=562
intron 713..1041
/number=9
/gene="AC-2"
exon 1257..1384
/number=10
/gene="AC-2"
/codon_start=1257
intron 1385..1570
/note="5' end no splice consensus"
/number=10
/gene="AC-2"
exon 1571..1688
/number=11
/gene="AC-2"
/codon_start=1571
intron 1689..1891
/number=11
/gene="AC-2"
BASE COUNT 752 a 474 c 533 g 651 t
ORIGIN
1 ggttaaagaa aatagattta cagctacgat cctcgagacg tctggaaaaa ctgcacaacg
61 ttctatattc gcgaccaagc caactgcggt gagctcagct gcaagagata ccaatgactc
121 cgaaaaattc aattcggaaa gagttagaat tttcaggctc atgttgggct gtttccacgg
181 cagctgcaat ttcggatcgc atttgcattg caagcaaagc tgaaaaacag gtgcaagtta
241 atctgtgata taataagcaa tcgtttactt caatgtgcaa catttaggtg aatatttctg
301 ccactgacat catgacgtgc tgcaggccac agtgcggtga cgggtaaaat ttcgtagtat
361 tgactctcag agcacggaaa taagtatata gccccatttg tcttaataat tcattgtttc
421 aggtgtgaag gaggatggcc tatcgaagct tggaaatact tcatatatga cggcgttgtt
481 tctggaggag aatacctcac taaagtgaga taactattcc taatattatt tgccgaaatg
541 ttctagcagt cttgacatta ggatgtatgc cgcccttatc caattcaccc atgtggacat
601 cacggaaacg acacctacta cggggaatgc cgtggaacag cgccaacccc accgtgcaaa
661 aggaaatgcc ggcccggcgt taggaaaatg tacaggatag acaagcgata cggtaagcgt
721 aagatctatc agatagtaac gttagaaatg cagtatctga aaggaagctt ggtccttctt
781 caaagcaaaa aataaatgca aacaagtaaa caaaacttag agaaattgtt gctattagtc
841 atgacagtag aagcaggtca ctcgggtatt aacagcacgt cacgaggccg attggagaag
901 tttttgttga aaaaatatgt ggggatctgt gatctttaca atctaatctt cttttctcag
961 caaatgatgg aagcgattag tgcagcatgt taaagacaga gttcacccgg atgtttgaca
1021 ctcgaatcat aaattggaaa ccacacaggt ttccaattta tgattcgaaa acacacgaaa
1081 ccacacacaa ccacatgagg ttcttcagtt agctattcat ttgtgaaacg tgactaaata
1141 cattgcctcg gcttcttgta tgatgtggtg tttatcaaaa gttaaccaat gatctgccgc
1201 ttcatgatat gcgcttgaac ggatttcttc gagcttcgta gagattgaat ttttaggaaa
1261 agacgcctac atcgtaaaac agtcggttaa agccattcag agtgaaatac taaagaatgg
1321 accggttgtg gcttcgtttg ccgtctatga agatttcagg cactacaaat caggaattta
1381 taaggcgagt cttctaaata aacgtcagtt ctaactgtta gtattcgcaa gcaacagctt
1441 gccattgtga attcgcttgg aaaaaagagc tcattgaaaa tgcaatctat atgcatccag
1501 atatgatcta tacaatctgc acagaaagtt gcggaaaaga aaatagaatg tcagcaagag
1561 atgattatag cacacagctg gtgagctacg agggtaccat gctgtaaaga tgattggatg
1621 gggaaatgaa aataatacag acttctggct cattgccaac tcttggcaca acgattgggg
1681 agaaaaaggt aagtcactca tgccaaaagt gtttgtacat ttgtactttg cttggttgtc
1741 gcagttttct tacgcgaccc atagattagg ccagtacctg aagcaacgcc gccaccatac
1801 accacgccgg ctcagaaaca gcagaaacac gctgcttctg tccatttttc gagagaggta
1861 agactatggt ccaatcgata gctattttca ggatatttcc gcatagttcg cggaagtaac
1921 gactgtggaa ttgaaggaac catcgccgct gggattgtcg acacagaaag tctatgatat
1981 tgcaccatgt cacagcaaaa tgaataaatt aactaatcat atgtgcatgt agaatcgtac
2041 tttgaggaag cctctgtaat gtattcgatg aaagctgggt tacatcgtat tgcacgtgct
2101 gaggagccta attggatttt ttgacgatgt tcaacacata gatgatcgca ggggctgtat
2161 tgctactgct cactgaagat ggttgaacat acagacgcgc tcagaatatt cggcgaaatg
2221 cttcgtcagt tatgcttttc agtcgtaact cctttcgtac tgaacctaat catgcgaaat
2281 ttcaatatgt gcagcttggg tgtagtgtga atatctacca ctattaacct gcaatctgtt
2341 catgttccag atatcaaaaa cggcttgaaa aatcagctgg aaacacgaac gatggtcctt
2401 ctagaattct
//