[bionet.molbio.genbank.updates] Murray Valley Encephalitis virus genome 5' region

GenBank-Updates@genbank.bio.net (05/29/91)

LOCUS       MVEMVEV5     5436 bp ss-mRNA            VRL       28-MAY-1991
DEFINITION  Murray Valley Encephalitis virus genome 5' region (5.4 kb) with
            genes C-prM(M)-E-NS1-ns2a-ns2b-NS3 (N-term)
ACCESSION   X03467
KEYWORDS    capsid protein; envelope glycoprotein; glycoprotein;
            membrane protein.
SOURCE      Murray Valley encephalitis virus RNA.
  ORGANISM  Murray Valley encephalitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
            Flaviviridae; Flavivirus (arbovirus group B).
REFERENCE   1  (bases 1 to 5436)
  AUTHORS   Dalgarno,L., Trent,D.W., Strauss,J.H. and Rice,C.M.
  TITLE     Partial nucleotide sequence of the Murray Valley encephalitis virus
            genome - Comparison of the encoded polypeptides with yellow fever
            virus structural and non-structural proteins
  JOURNAL   J. Mol. Biol. 187, 309-323 (1986)
  STANDARD  full automatic
COMMENT     SWISS-PROT; P05769; POLG$MVEV.
            
            Data kindly reviewed (01-AUG-1986) by Strauss J.H.
            
            From EMBL    entry FLMVEV5;  dated 06-JUL-1989.
FEATURES             Location/Qualifiers
     CDS             98..472
                     /note="put. capsid protein"
                     /codon_start=98
     CDS             473..973
                     /note="put. membrane protein precursor (formerly GP19)"
                     /codon_start=473
     misc_feature    515..517
                     /note="pot. N-linked glycosylation site"
     CDS             749..973
                     /note="put. mature membrane protein"
                     /codon_start=749
     CDS             974..2476
                     /note="put. envelope protein"
                     /codon_start=974
     misc_feature    1433..1435
                     /note="pot. N-linked glycosylation site"
     CDS             2477..>2476
                     /note="put. NS1 protein (formerly gp44 or NV3)"
                     /codon_start=2477
     misc_feature    2864..2866
                     /note="pot. N-linked glycosylation site"
     misc_feature    3095..3097
                     /note="pot. N-linked glycosylation site"
     CDS             3719..4213
                     /note="pot. ns2a protein"
                     /codon_start=3719
     CDS             4214..4606
                     /note="pot. ns2b protein"
                     /codon_start=4214
     CDS             4607..>5436
                     /note="put. NS3 protein (formerly P71 or NV4) (5436 is 2nd
                     base in codon)"
                     /codon_start=4607
BASE COUNT     1525 a   1156 c   1457 g   1294 t      4 others
ORIGIN
        1 nnnnacgttc atctgcgtga gcttccgatc tcagtattgt ttggaaggat cattgattaa
       61 cgcggtttga acagtttttt ggagcttttg atttcaaatg tctaaaaaac caggaggacc
      121 cgggaagccc cgggtcgtca atatgctaaa acgcggcata ccccgcgtat tcccactagt
      181 gggagtgaag agggtagtaa tgaacttgct agatggcaga gggccaatac ggtttgtgtt
      241 ggctctctta gctttcttca ggtttacagc acttgccccg accaaggcct tgatgaggcg
      301 ctggaagagc gtgaacaaga caacggccat gaaacatctg accagtttta agaaagaatt
      361 aggaacactg attgatgtgg tgaacaaaag gggcaaaaaa caaaagaaaa gaggtggcag
      421 tgaaacatcc gtgcttatgg tcattttcat gctgattgga tttgccgctg ccttaaagct
      481 ttccaccttc cagggcaaga taatgatgac tgtgaacgct acggacattg ctgatgtgat
      541 cgccattcca accccgaagg gacccaatca atgctggatt cgagccattg acattggatt
      601 tatgtgtgat gacaccatca cttatgaatg cccgaaattg gaaagtggaa atgaccctga
      661 agacattgac tgctggtgtg acaaacaagc tgtgtacgta aactatggaa ggtgcacacg
      721 tgctcgccat tcaaagcgca gtcgtcgttc catcacagtg cagactcatg gtgaaagcac
      781 tttggtcaac aaaaaggatg cctggctgga ttccacgaag gccacgcgtt atctcaccaa
      841 aacagagaac tggattataa gaaatcctgg ttacgcgctg gtggccgttg tccttggctg
      901 gatgctgggc agcaacactg gacaaaaagt tatttttaca gtgcttttgc tcctcgttgc
      961 tcctgcctac agttttaact gtctgggaat gagcagccgt gatttcattg aaggtgcttc
     1021 aggagctaca tgggtcgatt tggtgctgga gggcgacagt tgcatcacca tcatggccgc
     1081 tgacaaaccc acccttgaca taagaatgat gaacattgaa gccaccaatc ttgcactggt
     1141 tagaaattac tgctatgcag ctactgtgtc agacgtttct acggtgtcaa actgtcctac
     1201 tacaggggag tcacacaaca cgaagcgggc agatcacaat tacttgtgca aacgaggtgt
     1261 gaccgacaga ggctggggta atggatgtgg cttgtttggt aaggggagca ttgacacatg
     1321 cgcaaagttc acctgctcta actcagctgc ggggagactt atcttacctg aggacatcaa
     1381 atatgaagtt ggggtttttg ttcacggatc aacggactca accagtcatg gaaattattc
     1441 tacccaaatt ggagctaacc aagcagtcag gttcaccatt tcaccaaacg ctccagccat
     1501 cacagcaaag atgggcgact atggagaagt cactgtggag tgtgaaccga ggagtggact
     1561 gaatacagag gcctactacg tcatgaccat tggaacgaaa cactttctag tgcatcgtga
     1621 gtggttcaat gatttgctct tgccatggac atcacctgca agcacggaat ggaggaatag
     1681 agaaattctc gtggagtttg aagagccaca tgccaccaaa caatcagtgg ttgccttggg
     1741 ttcacaggaa ggagctttgc accaagctct ggctggagcc ataccagtcg agttttcgag
     1801 cagcacactt aaactcactt caggacacct taagtgtcgc gtgaaaatgg agaaattgaa
     1861 actgaaagga accacttatg ggatgtgcac agaaaaattt actttctcaa agaatccagc
     1921 cgacaccggt catggcacgg tagttctaga actgcagtac accgggagtg atggaccatg
     1981 caaaattcca atatcctctg tagcaagtct caatgacatg acgcctgtcg ggagaatggt
     2041 gacagctaat ccatatgtag cttcatcaac tgccaatgct aaagttctgg tggagattga
     2101 accacccttc ggagactcat acattgtggt aggcagggga gacaagcaga tcaatcacca
     2161 ctggcataag gagggtagtt caattggcaa agccttcagc acaaccttga agggagcaca
     2221 gagattggca gctcttggag acacggcgtg ggactttgga tcagtaggcg gagtcttcaa
     2281 ttcaatcgga aaggcagtac accaagtctt tggaggagca tttagaaccc tctttggagg
     2341 aatgtcatgg atcagccaag gtctgctggg ggcgttactg ctatggatgg gggtcaatgc
     2401 tagagataaa tcaattgctt tggctttcct agcaacagga ggcgttttgt tgttcctggc
     2461 cacaaatgtc catgctgaca ctggttgcgc gattgacatc accaggaggg agctcaagtg
     2521 tggcagcgga atattcatac acaatgatgt tgaagcctgg attgaccgct acaagtacct
     2581 gccagagacc cccaagcaac tggctaaagt ggttgaaaac gctcacaaga gcggaatatg
     2641 tgggatacgg tcagtgaata gatttgaaca tcaaatgtgg gaatctgtgc gtgatgaact
     2701 caatgcttta ctcaaggaaa atgccattga tttgagtgtt gtcgtggaaa aacagaaagg
     2761 catgtacaga gcagcaccca ataggctgag actcactgtg gaagaacttg atataggctg
     2821 gaaggcctgg ggtaagagtt tgctttttgc ggcggaattg gccaattcaa cgtttgtggt
     2881 tgatggacct gaaacagctg aatgtcctaa ttcaaagagg gcatggaaca gctttgaaat
     2941 tgaagacttt ggatttggca taacatccac cagggtttgg cttaaactca gagaggaaaa
     3001 tacctcggag tgtgacagca ccattattgg gaccgcagtt aagggcaacc atgcagtaca
     3061 cagtgacctt tcctactgga ttgagagtgg actcaatggg acatggaaac ttgagagagc
     3121 catttttgga gaggtaaaat cctgcacctg gcccgagaca cacacactat ggggtgatgc
     3181 agtggaagaa acggagttga taatcccagt gactctcgcc ggcccacgca gcaagcacaa
     3241 cagaagagaa ggatacaagg ttcaagttca aggtccgtgg gatgaagaag acataaaatt
     3301 ggactttgac tactgcccag gaacaaccgt cacagtgagt gaacattgtg gaaaacgagg
     3361 tccctcagta cgcaccacaa ctgacagcgg gaaactcgtc acggactggt gttgtaggag
     3421 ctgcacgctt cctcctttaa gatttaccac ggccagtgga tgttggtatg gaatggagat
     3481 aagacctatg aagcatgatg agtccactct agttaaatca agggttcaag catttaatgg
     3541 agatatgatt gatccttttc agttaggcct tctggtgatg tttctggcca cccaggaggt
     3601 cttgaggaag aggtggacgg ccagacttac tctgccagca gcggttgggg ctctgctagt
     3661 cctcctcctt gggggcatta cctacactga tctagtgcgg tatctcatat tggtgggttc
     3721 agcatttgca gaatcaaaca acggaggtga tgtcattcat ttggcactca ttgctgtatt
     3781 caaggtacag cccgcttttc ttgttgccag cttgacacgc agtagatgga ctaatcaaga
     3841 gaatctcgtc ctggtcttgg gagcggcctt ctttcagatg gcagcttcag acctggagtt
     3901 gacaattcca ggtttgttga actcagctgc tacagcttgg atggtgttgc gagcaatggc
     3961 ttttccgtca acctcagcaa tagccatgcc catgctagca atgcttgctc cgggaatgag
     4021 aatgcttcac cttgacacat ataggatagt gctcttgctg attggcattt gcagcttgct
     4081 gaatgagaga aggaggtctg tggagaaaaa gaaaggagct gttttaattg gtctagccct
     4141 gacttcgact ggatattttt caccaacaat catggcagca ggccttatga tatgcaaccc
     4201 taacaagaag agaggatggc ccgccacaga ggtgttaaca gcagtgggac tgatgttcgc
     4261 cattgttgga ggactcgctg agctagacat tgattctatg tcagtccctt ttaccatagc
     4321 tggcctcatg ttagtgtcct atgttatctc gggaaaagcc acagacatgt ggcttgaaag
     4381 agcagcagat gtgtcatggg aagcgggggc agcaataaca ggtaccagcg aaagactgga
     4441 tgtccaattg gacgatgatg gagactttca tctgctcaat gaccctggag ttccatggaa
     4501 aatctgggtt ttaagaatga catgcttaag cgtagctgcc atcactccat gggccatttt
     4561 gccatccgcg tttgggtatt ggctgactct gaagtacaca aaacgagggg gtgttttttg
     4621 ggatacgcct tccccaaaag tgtatcccaa gggggatacg acgccaggag tctaccgaat
     4681 aatggccaga ggtatcctgg gaaggtacca agctggggta ggagtcatgc atgagggtgt
     4741 gttccacacg ctgtggcaca caactagagg agctgccata atgagtggtg aaggaagatt
     4801 gacaccatac tggggcaatg tgaaagagga cagagtgacc tatggtggcc catggaaatt
     4861 agatcaaaaa tggaatggag tggatgacgt ccagatgata gtggtggaac caggaaagcc
     4921 agcaataaac gtgcagacca agcccgggat ttttaagaca gcacacggag aaattggagc
     4981 cgtgagcttg gattacccga ttggaacttc aggatcccca atagtcaata gcaatggaga
     5041 aatcattggc ctttatggaa atggagtgat actaggtaat ggggcctacg tcagtgctat
     5101 tgttcaagga gaaagagtag aggaaccagt gccagaggcc tacaatcctg agatgctaaa
     5161 gaaaaggcaa ctaaccgtgc tggacctgca cccaggtgct gggaaaacac gacgcatact
     5221 cccccaaata attaaagatg ccatccaaaa aagactacga acagctgttc ttgcaccaac
     5281 gagggtggtt gcagcagaaa tggcagaagc tttgaggggc cttccagtta ggtatttgac
     5341 tccagctgta caaagagaac acagtggaaa cgagatagtg gatgtgatgt gccatgcgac
     5401 actaacacat cgattgatgt cgccgctaag aggccc
//