[bionet.molbio.genbank.updates] Hepatitis C virus core, matrix, envelope and non-structural protein

GenBank-Updates@genbank.bio.net (03/29/91)

LOCUS       HPCHUMR      9416 bp ss-RNA             VRL       29-MAR-1991
DEFINITION  Hepatitis C virus core, matrix, envelope and non-structural protein
            RNA.
ACCESSION   M58335
KEYWORDS    core protein; envelope protein; matrix protein;
            non-structural protein.
SOURCE      Hepatitis C virus isolated from human plasma, cDNA to genomic RNA.
  ORGANISM  Hepatitis C virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
            Togaviridae incertae sedis.
REFERENCE   1  (bases 1 to 9416)
  AUTHORS   Takamizawa,A., Mori,C., Manabe,S., Murakami,S., Fujita,J.,
            Onishi,E., Andoh,T., Yoshida,I. and Okayama,H.
  TITLE     The structure and organization of the Hepatitis C virus genome
            isolated from human carriers
  JOURNAL   J. Virol. 65, 1105-1113 (1991)
  STANDARD  simple staff_entry
FEATURES             Location/Qualifiers
     CDS             333..9365
                     /label=ORF
                     /codon_start=333
     mat_peptide     336..677
                     /product="core protein"
                     /codon_start=336
     mat_peptide     678..905
                     /product="matrix protein"
                     /codon_start=678
     mat_peptide     906..1499
                     /product="envelope protein"
                     /codon_start=906
     mat_peptide     1500..2519
                     /product="non-structural protein"
                     /label=NS1
                     /codon_start=1500
     mat_peptide     2520..3350
                     /product="non-structural protein"
                     /label=NS2
                     /codon_start=2520
     mat_peptide     3351..5177
                     /product="non-structural protein"
                     /label=NS3
                     /codon_start=3351
     mat_peptide     5178..5918
                     /product="non-structural protein"
                     /label=NS4a
                     /codon_start=5178
     mat_peptide     5919..6371
                     /product="non-structural protein"
                     /label=NS4b
                     /codon_start=5919
     mat_peptide     6372..9362
                     /product="non-structural protein"
                     /codon_start=6372
BASE COUNT     1905 a   2825 c   2679 g   2007 t
ORIGIN
        1 cgattggggg cgacactcca ccatagatca ctcccctgtg aggaactact gtcttcacgc
       61 agaaagcgtc tagccatggc gttagtatga gtgtcgtgca gcctccagga ccccccctcc
      121 cgggagagcc atagtggtct gcggaaccgg tgagtacacc ggaattgcca ggacgaccgg
      181 gtcctttctt ggatcaaccc gctcaatgcc tggagatttg ggcgtgcccc cgcgagactg
      241 ctagccgagt agtgttgggt cgcgaaaggc cttgtggtac tgcctgatag ggtgcttgcg
      301 agtgccccgg gaggtctcgt agaccgtgca ccatgagcac gaatcctaaa cctcaaagaa
      361 aaaccaaacg taacaccaac cgccgcccac aggacgtcaa gttcccgggc ggtggtcaga
      421 tcgttggtgg agtttacctg ttgccgcgca ggggccccag gttgggtgtg cgcgcgccca
      481 ggaagacttc cgagcggtcg caacctcgtg gaaggcgaca acctatcccc aaggctcgcc
      541 ggcccgaggg caggacctgg gctcagcccg ggtacccttg gcctctctat ggcaatgagg
      601 gcttagggtg ggcaggatgg ctcctgtcac cccgcggctc ccggcctagt tggggcccca
      661 cggacccccg gcgtaggtcg cgtaatttgg gtaaggtcat cgataccctc acatgcggct
      721 tcgccgatct catggggtac attccgctcg tcggcgcccc cctggggggc gctgccaggg
      781 ccctggcaca tggtgtccgg gttctggagg acggcgtgaa ctatgcaaca gggaatctgc
      841 ccggttgctc tttttctatc ttcctcttgg ctctgctgtc ctgcctgacc accccagctt
      901 ccgcttacga agtgcacaac gtgtccggga tatatcatgt cacgaacgac tgctccaacg
      961 caagcattgt gtatgaggca gcggacttga tcatgcatac tcctgggtgc gtgccctgcg
     1021 ttcgggaagg caactcctcc cgctgctggg tagcgctcac tcccacgctc gcagccagga
     1081 acgtcaccat ccccaccacg acgatacgac gccacgtcga tctgctcgtt ggggcggctg
     1141 ctttctgttc cgctatgtac gtgggggacc tctgcggatc tgttttcctc gtctctcagc
     1201 tgttcacctt ctcgcctcgc cggcatgtga cattacagga ctgtaactgc tcaatttatc
     1261 ccggccatgt gtcgggtcac cgtatggctt gggacatgat gatgaactgg tcgcccacaa
     1321 cagccctagt ggtgtcgcag ttactccgga tcccacaagc cgtcgtggac atggtggcgg
     1381 gggcccactg gggagtcctg gcgggccttg cctactattc catggcgggg aactgggcta
     1441 aggttctgat tgtgatgcta ctttttgctg gcgttgacgg ggatacccac gtgacagggg
     1501 gggcgcaagc caaaaccacc aacaggctcg tgtccatgtt cgcaagtggg ccgtctcaga
     1561 aaatccagct tataaacacc aatgggagtt ggcacatcaa caggactgcc ctgaactgca
     1621 atgactctct ccagactggg tttcttgccg cgctgttcta cacacatagt ttcaactcgt
     1681 ccgggtgccc agagcgcatg gcccagtgcc gcaccattga caagttcgac cagggatggg
     1741 gtcccattac ttatgctgag tctagcagat cagaccagag gccatattgc tggcactacc
     1801 cacctccaca atgtaccatc gtacctgcgt cggaggtgtg cggcccagtg tactgcttca
     1861 ccccaagccc tgtcgtcgtg gggacgaccg atcgtttcgg tgtccctacg tatagatggg
     1921 gggagaacga gactgacgtg ctgctgctca acaacacgcg gccgccgcaa ggcaactggt
     1981 tcggctgcac atggatgaat agcaccgggt tcaccaagac atgtgggggg cccccgtgta
     2041 acatcggggg ggtcggcaac aacaccctga cctgccccac ggactgcttc cggaagcacc
     2101 ccgaggctac ctacacaaaa tgtggttcgg ggccttggct gacacctagg tgcatggttg
     2161 actatccata caggctctgg cattacccct gcactgttaa ctttaccatc ttcaaggtta
     2221 ggatgtatgt ggggggggtg gagcacaggc tcaatgctgc atgcaattgg acccgaggag
     2281 agcgttgtga cttggaggac agggataggc cggagctcag cccgctgctg ctgtctacaa
     2341 cagagtggca ggtactgccc tgttccttca ccaccctacc agctctgtcc actggcttga
     2401 ttcacctcca tcagaacatc gtggacgtgc aatacctata cggtataggg tcagcggttg
     2461 tctcctttgc aatcaaatgg gagtatgtcc tgttgctttt ccttctccta gcggacgcac
     2521 gtgtctgtgc ctgcttgtgg atgatgctgc tgatagccca ggccgaggcc gccttggaga
     2581 acctggtggt cctcaattcg gcgtctgtgg ccggcgcaca tggcatcctc tccttccttg
     2641 tgttcttctg tgccgcctgg tacatcaaag gcaggctggt ccctggggcg acatatgctc
     2701 tttatggcgt gtggccgctg ctcctgctct tgctggcatt accaccgcga gcttacgcca
     2761 tggaccggga gatggctgca tcgtgcggag gcgcggtttt tgtgggtctg gtactcctga
     2821 ctttgtcacc atactacaag gtgttcctcg ctaggctcat atggtggtta caatatttta
     2881 ccaccagagc cgaggcggac ttacatgtgt ggatcccccc cctcaacgct cggggaggcc
     2941 gcgatgccat catcctcctc atgtgcgcag tccatccaga gctaatcttt gacatcacca
     3001 aacttctaat tgccatactc ggtccgctca tggtgctcca agctggcata accagagtgc
     3061 cgtacttcgt gcgcgctcaa gggctcattc atgcatgcat gttagtgcgg aaggtcgctg
     3121 ggggtcatta tgtccaaatg gccttcatga agctgggcgc gctgacaggc acgtacattt
     3181 acaaccatct taccccgcta cgggattggc cacgcgcggg cctacgagac cttgcggtgg
     3241 cagtggagcc cgtcgtcttc tccgacatgg agaccaagat catcacctgg ggagcagaca
     3301 ccgcggcgtg tggggacatc atcttgggtc tgcccgtctc cgcccgaagg ggaaaggaga
     3361 tactcctggg cccggccgat agtcttgaag ggcgggggtt gcgactcctc gcgcccatca
     3421 cggcctactc ccaacagacg cggggcctac ttggttgcat catcactagc cttacaggcc
     3481 gggacaagaa ccaggtcgag ggagaggttc aggtggtttc caccgcaaca caatccttcc
     3541 tggcgacctg cgtcaacggc gtgtgttgga ccgtttacca tggtgctggc tcaaagacct
     3601 tagccgcgcc aaaggggcca atcacccaga tgtacactaa tgtggaccag gacctcgtcg
     3661 gctggcccaa gccccccggg gcgcgttcct tgacaccatg cacctgtggc agctcagacc
     3721 tttacttggt cacgagacat gctgacgtca ttccggtgcg ccggcggggc gacagtaggg
     3781 ggagcctgct ctcccccagg cctgtctcct acttgaaggg ctcttcgggt ggtccactgc
     3841 tctgcccctt cgggcacgct gtgggcatct tccgggctgc cgtatgcacc cggggggttg
     3901 cgaaggcggt ggactttgtg cccgtagagt ccatggaaac tactatgcgg tctccggtct
     3961 tcacggacaa ctcatccccc ccggccgtac cgcagtcatt tcaagtggcc cacctacacg
     4021 ctcccactgg cagcggcaag agtactaaag tgccggctgc atatgcagcc caagggtaca
     4081 aggtgctcgt cctcaatccg tccgttgccg ctaccttagg gtttggggcg tatatgtcta
     4141 aggcacacgg tattgacccc aacatcagaa ctggggtaag gaccattacc acaggcgccc
     4201 ccgtcacata ctctacctat ggcaagtttc ttgccgatgg tggttgctct gggggcgctt
     4261 atgacatcat aatatgtgat gagtgccatt caactgactc gactacaatc ttgggcatcg
     4321 gcacagtcct ggaccaagcg gagacggctg gagcgcggct tgtcgtgctc gccaccgcta
     4381 cgcctccggg atcggtcacc gtgccacacc caaacatcga ggaggtggcc ctgtctaata
     4441 ctggagagat ccccttctat ggcaaagcca tccccattga agccatcagg gggggaaggc
     4501 atctcatttt ctgtcattcc aagaagaagt gcgacgagct cgccgcaaag ctgtcaggcc
     4561 tcggaatcaa cgctgtggcg tattaccggg ggctcgatgt gtccgtcata ccaactatcg
     4621 gagacgtcgt tgtcgtggca acagacgctc tgatgacggg ctatacgggc gactttgact
     4681 cagtgatcga ctgtaacaca tgtgtcaccc agacagtcga cttcagcttg gatcccacct
     4741 tcaccattga gacgacgacc gtgcctcaag acgcagtgtc gcgctcgcag cggcggggta
     4801 ggactggcag gggtaggaga ggcatctaca ggtttgtgac tccgggagaa cggccctcgg
     4861 gcatgttcga ttcctcggtc ctgtgtgagt gctatgacgc gggctgtgct tggtacgagc
     4921 tcaccccggc cgagacctcg gttaggttgc gggcctacct gaacacacca gggttgcccg
     4981 tttgccagga ccacctggag ttctgggaga gtgtcttcac aggcctcacc catatagatg
     5041 cacacttctt gtcccagacc aagcaggcag gagacaactt cccctacctg gtagcatacc
     5101 aagccacggt gtgcgccagg gctcaggccc cacctccatc atgggatcaa atgtggaagt
     5161 gtctcatacg gctgaaacct acgctgcacg ggccaacacc cttgctgtac aggctgggag
     5221 ccgtccagaa tgaggtcacc ctcacccacc ccataaccaa atacatcatg gcatgcatgt
     5281 cggctgacct ggaggtcgtc actagcacct gggtgctggt gggcggagtc cttgcagctc
     5341 tggccgcgta ttgcctgaca acaggcagtg tggtcattgt gggtaggatt atcttgtccg
     5401 ggaggccggc cattgttccc gacagggagc ttctctacca ggagttcgat gaaatggaag
     5461 agtgcgcctc gcacctccct tacatcgagc agggaatgca gctcgccgag caattcaagc
     5521 agaaagcgct cgggttactg caaacagcca ccaaacaagc ggaggctgct gctcccgtgg
     5581 tggagtccaa gtggcgagcc cttgagacat tctgggcgaa gcacatgtgg aatttcatca
     5641 gcgggataca gtacttagca ggcttatcca ctctgcctgg gaaccccgca atagcatcat
     5701 tgatggcatt cacagcctct atcaccagcc cgctcaccac ccaaagtacc ctcctgttta
     5761 acatcttggg ggggtgggtg gctgcccaac tcgccccccc cagcgccgct tcggctttcg
     5821 tgggcgccgg catcgccggt gcggctgttg gcagcatagg ccttgggaag gtgcttgtgg
     5881 acattctggc gggttatgga gcaggagtgg ccggcgcgct cgtggccttt aaggtcatga
     5941 gcggcgagat gccctccacc gaggacctgg tcaatctact tcctgccatc ctctctcctg
     6001 gcgccctggt cgtcggggtc gtgtgtgcag caatactgcg tcgacacgtg ggtccgggag
     6061 agggggctgt gcagtggatg aaccggctga tagcgttcgc ctcgcggggt aatcatgttt
     6121 cccccacgca ctatgtgcct gagagcgacg ccgcagcgcg tgttactcag atcctctcca
     6181 gccttaccat cactcagctg ctgaaaaggc tccaccagtg gattaatgaa gactgctcca
     6241 caccgtgttc cggctcgtgg ctaagggatg tttgggactg gatatgcacg gtgttgactg
     6301 acttcaagac ctggctccag tccaagctcc tgccgcagct acctggagtc ccttttttct
     6361 cgtgccaacg cgggtacaag ggagtctggc ggggagacgg catcatgcaa accacctgcc
     6421 catgtggagc acagatcacc ggacatgtca aaaacggttc catgaggatc gtcgggccta
     6481 agacctgcag caacacgtgg catggaacat tccccatcaa cgcatacacc acgggcccct
     6541 gcacaccctc tccagcgcca aactattcta gggcgctgtg gcgggtggcc gctgaggagt
     6601 acgtggaggt cacgcgggtg ggggatttcc actacgtgac gggcatgacc actgacaacg
     6661 taaagtgccc atgccaggtt ccggctcctg aattcttctc ggaggtggac ggagtgcggt
     6721 tgcacaggta cgctccggcg tgcaggcctc tcctacggga ggaggttaca ttccaggtcg
     6781 ggctcaacca atacctggtt gggtcacagc taccatgcga gcccgaaccg gatgtagcag
     6841 tgctcacttc catgctcacc gacccctccc acatcacagc agaaacggct aagcgtaggt
     6901 tggccagggg gtctcccccc tccttggcca gctcttcagc tagccagttg tctgcgcctt
     6961 ccttgaaggc gacatgcact acccaccatg tctctccgga cgctgacctc atcgaggcca
     7021 acctcctgtg gcggcaggag atgggcggga acatcacccg cgtggagtcg gagaacaagg
     7081 tggtagtcct ggactctttc gacccgcttc gagcggagga ggatgagagg gaagtatccg
     7141 ttccggcgga gatcctgcgg aaatccaaga agttccccgc agcgatgccc atctgggcgc
     7201 gcccggatta caaccctcca ctgttagagt cctggaagga cccggactac gtccctccgg
     7261 tggtgcacgg gtgcccgttg ccacctatca aggcccctcc aataccacct ccacggagaa
     7321 agaggacggt tgtcctaaca gagtcctccg tgtcttctgc cttagcggag ctcgctacta
     7381 agaccttcgg cagctccgaa tcatcggccg tcgacagcgg cacggcgacc gcccttcctg
     7441 accaggcctc cgacgacggt gacaaaggat ccgacgttga gtcgtactcc tccatgcccc
     7501 cccttgaggg ggaaccgggg gaccccgatc tcagtgacgg gtcttggtct accgtgagcg
     7561 aggaagctag tgaggatgtc gtctgctgct caatgtccta cacatggaca ggcgccttga
     7621 tcacgccatg cgctgcggag gaaagcaagc tgcccatcaa cgcgttgagc aactctttgc
     7681 tgcgccacca taacatggtt tatgccacaa catctcgcag cgcaggcctg cggcagaaga
     7741 aggtcacctt tgacagactg caagtcctgg acgaccacta ccgggacgtg ctcaaggaga
     7801 tgaaggcgaa ggcgtccaca gttaaggcta aactcctatc cgtagaggaa gcctgcaagc
     7861 tgacgccccc acattcggcc aaatccaagt ttggctatgg ggcaaaggac gtccggaacc
     7921 tatccagcaa ggccgttaac cacatccact ccgtgtggaa ggacttgctg gaagacactg
     7981 tgacaccaat tgacaccacc atcatggcaa aaaatgaggt tttctgtgtc caaccagaga
     8041 aaggaggccg taagccagcc cgccttatcg tattcccaga tctgggagtc cgtgtatgcg
     8101 agaagatggc cctctatgat gtggtctcca cccttcctca ggtcgtgatg ggctcctcat
     8161 acggattcca gtactctcct gggcagcgag tcgagttcct ggtgaatacc tggaaatcaa
     8221 agaaaaaccc catgggcttt tcatatgaca ctcgctgttt cgactcaacg gtcaccgaga
     8281 acgacatccg tgttgaggag tcaatttacc aatgttgtga cttggccccc gaagccagac
     8341 aggccataaa atcgctcaca gagcggcttt atatcggggg tcctctgact aattcaaaag
     8401 ggcagaactg cggttatcgc cggtgccgcg cgagcggcgt gctgacgact agctgcggta
     8461 acaccctcac atgttacttg aaggcctctg cagcctgtcg agctgcgaag ctccaggact
     8521 gcacgatgct cgtgaacgga gacgacctcg tcgttatctg tgaaagcgcg ggaacccaag
     8581 aggacgcggc gagcctacga gtcttcacgg aggctatgac taggtactcc gccccccccg
     8641 gggacccgcc ccaaccagaa tacgacttgg agctgataac atcatgttcc tccaatgtgt
     8701 cggtcgccca cgatgcatca ggcaaaaggg tgtactacct cacccgtgat cccaccaccc
     8761 ccctagcacg ggctgcgtgg gagacagcta gacacactcc agttaactcc tggctaggca
     8821 acattattat gtatgcgccc actttgtggg caaggatgat tctgatgact cacttcttct
     8881 ccatccttct agcgcaggag caacttgaaa aagccctgga ctgccagatc tacggggcct
     8941 gttactccat tgagccactt gacctacctc agatcattga acgactccat ggccttagcg
     9001 cattttcact ccatagttac tctccaggtg agatcaatag ggtggcttca tgcctcagga
     9061 aacttggggt accacccttg cgagtctgga gacatcgggc caggagcgtc cgcgctaggc
     9121 tactgtccca gggagggagg gccgccactt gtggcaaata cctcttcaac tgggcagtaa
     9181 aaaccaaact taaactcact ccaatcccgg ctgcgtcccg gctggacttg tccggctggt
     9241 tcgttgctgg ttacagcggg ggagacatat atcacagcct gtctcgtgcc cgaccccgtt
     9301 ggttcatgct gtgcctactc ctactttctg taggggtagg catctacctg ctccccaacc
     9361 gatgaacggg gagataaaca ctccaggcca ataggccatc cccctttttt tttttt
//