GenBank-Updates@genbank.bio.net (05/29/91)
LOCUS MVEMVEV5 5436 bp ss-mRNA VRL 28-MAY-1991 DEFINITION Murray Valley Encephalitis virus genome 5' region (5.4 kb) with genes C-prM(M)-E-NS1-ns2a-ns2b-NS3 (N-term) ACCESSION X03467 KEYWORDS capsid protein; envelope glycoprotein; glycoprotein; membrane protein. SOURCE Murray Valley encephalitis virus RNA. ORGANISM Murray Valley encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Flaviviridae; Flavivirus (arbovirus group B). REFERENCE 1 (bases 1 to 5436) AUTHORS Dalgarno,L., Trent,D.W., Strauss,J.H. and Rice,C.M. TITLE Partial nucleotide sequence of the Murray Valley encephalitis virus genome - Comparison of the encoded polypeptides with yellow fever virus structural and non-structural proteins JOURNAL J. Mol. Biol. 187, 309-323 (1986) STANDARD full automatic COMMENT SWISS-PROT; P05769; POLG$MVEV. Data kindly reviewed (01-AUG-1986) by Strauss J.H. From EMBL entry FLMVEV5; dated 06-JUL-1989. FEATURES Location/Qualifiers CDS 98..472 /note="put. capsid protein" /codon_start=98 CDS 473..973 /note="put. membrane protein precursor (formerly GP19)" /codon_start=473 misc_feature 515..517 /note="pot. N-linked glycosylation site" CDS 749..973 /note="put. mature membrane protein" /codon_start=749 CDS 974..2476 /note="put. envelope protein" /codon_start=974 misc_feature 1433..1435 /note="pot. N-linked glycosylation site" CDS 2477..>2476 /note="put. NS1 protein (formerly gp44 or NV3)" /codon_start=2477 misc_feature 2864..2866 /note="pot. N-linked glycosylation site" misc_feature 3095..3097 /note="pot. N-linked glycosylation site" CDS 3719..4213 /note="pot. ns2a protein" /codon_start=3719 CDS 4214..4606 /note="pot. ns2b protein" /codon_start=4214 CDS 4607..>5436 /note="put. NS3 protein (formerly P71 or NV4) (5436 is 2nd base in codon)" /codon_start=4607 BASE COUNT 1525 a 1156 c 1457 g 1294 t 4 others ORIGIN 1 nnnnacgttc atctgcgtga gcttccgatc tcagtattgt ttggaaggat cattgattaa 61 cgcggtttga acagtttttt ggagcttttg atttcaaatg tctaaaaaac caggaggacc 121 cgggaagccc cgggtcgtca atatgctaaa acgcggcata ccccgcgtat tcccactagt 181 gggagtgaag agggtagtaa tgaacttgct agatggcaga gggccaatac ggtttgtgtt 241 ggctctctta gctttcttca ggtttacagc acttgccccg accaaggcct tgatgaggcg 301 ctggaagagc gtgaacaaga caacggccat gaaacatctg accagtttta agaaagaatt 361 aggaacactg attgatgtgg tgaacaaaag gggcaaaaaa caaaagaaaa gaggtggcag 421 tgaaacatcc gtgcttatgg tcattttcat gctgattgga tttgccgctg ccttaaagct 481 ttccaccttc cagggcaaga taatgatgac tgtgaacgct acggacattg ctgatgtgat 541 cgccattcca accccgaagg gacccaatca atgctggatt cgagccattg acattggatt 601 tatgtgtgat gacaccatca cttatgaatg cccgaaattg gaaagtggaa atgaccctga 661 agacattgac tgctggtgtg acaaacaagc tgtgtacgta aactatggaa ggtgcacacg 721 tgctcgccat tcaaagcgca gtcgtcgttc catcacagtg cagactcatg gtgaaagcac 781 tttggtcaac aaaaaggatg cctggctgga ttccacgaag gccacgcgtt atctcaccaa 841 aacagagaac tggattataa gaaatcctgg ttacgcgctg gtggccgttg tccttggctg 901 gatgctgggc agcaacactg gacaaaaagt tatttttaca gtgcttttgc tcctcgttgc 961 tcctgcctac agttttaact gtctgggaat gagcagccgt gatttcattg aaggtgcttc 1021 aggagctaca tgggtcgatt tggtgctgga gggcgacagt tgcatcacca tcatggccgc 1081 tgacaaaccc acccttgaca taagaatgat gaacattgaa gccaccaatc ttgcactggt 1141 tagaaattac tgctatgcag ctactgtgtc agacgtttct acggtgtcaa actgtcctac 1201 tacaggggag tcacacaaca cgaagcgggc agatcacaat tacttgtgca aacgaggtgt 1261 gaccgacaga ggctggggta atggatgtgg cttgtttggt aaggggagca ttgacacatg 1321 cgcaaagttc acctgctcta actcagctgc ggggagactt atcttacctg aggacatcaa 1381 atatgaagtt ggggtttttg ttcacggatc aacggactca accagtcatg gaaattattc 1441 tacccaaatt ggagctaacc aagcagtcag gttcaccatt tcaccaaacg ctccagccat 1501 cacagcaaag atgggcgact atggagaagt cactgtggag tgtgaaccga ggagtggact 1561 gaatacagag gcctactacg tcatgaccat tggaacgaaa cactttctag tgcatcgtga 1621 gtggttcaat gatttgctct tgccatggac atcacctgca agcacggaat ggaggaatag 1681 agaaattctc gtggagtttg aagagccaca tgccaccaaa caatcagtgg ttgccttggg 1741 ttcacaggaa ggagctttgc accaagctct ggctggagcc ataccagtcg agttttcgag 1801 cagcacactt aaactcactt caggacacct taagtgtcgc gtgaaaatgg agaaattgaa 1861 actgaaagga accacttatg ggatgtgcac agaaaaattt actttctcaa agaatccagc 1921 cgacaccggt catggcacgg tagttctaga actgcagtac accgggagtg atggaccatg 1981 caaaattcca atatcctctg tagcaagtct caatgacatg acgcctgtcg ggagaatggt 2041 gacagctaat ccatatgtag cttcatcaac tgccaatgct aaagttctgg tggagattga 2101 accacccttc ggagactcat acattgtggt aggcagggga gacaagcaga tcaatcacca 2161 ctggcataag gagggtagtt caattggcaa agccttcagc acaaccttga agggagcaca 2221 gagattggca gctcttggag acacggcgtg ggactttgga tcagtaggcg gagtcttcaa 2281 ttcaatcgga aaggcagtac accaagtctt tggaggagca tttagaaccc tctttggagg 2341 aatgtcatgg atcagccaag gtctgctggg ggcgttactg ctatggatgg gggtcaatgc 2401 tagagataaa tcaattgctt tggctttcct agcaacagga ggcgttttgt tgttcctggc 2461 cacaaatgtc catgctgaca ctggttgcgc gattgacatc accaggaggg agctcaagtg 2521 tggcagcgga atattcatac acaatgatgt tgaagcctgg attgaccgct acaagtacct 2581 gccagagacc cccaagcaac tggctaaagt ggttgaaaac gctcacaaga gcggaatatg 2641 tgggatacgg tcagtgaata gatttgaaca tcaaatgtgg gaatctgtgc gtgatgaact 2701 caatgcttta ctcaaggaaa atgccattga tttgagtgtt gtcgtggaaa aacagaaagg 2761 catgtacaga gcagcaccca ataggctgag actcactgtg gaagaacttg atataggctg 2821 gaaggcctgg ggtaagagtt tgctttttgc ggcggaattg gccaattcaa cgtttgtggt 2881 tgatggacct gaaacagctg aatgtcctaa ttcaaagagg gcatggaaca gctttgaaat 2941 tgaagacttt ggatttggca taacatccac cagggtttgg cttaaactca gagaggaaaa 3001 tacctcggag tgtgacagca ccattattgg gaccgcagtt aagggcaacc atgcagtaca 3061 cagtgacctt tcctactgga ttgagagtgg actcaatggg acatggaaac ttgagagagc 3121 catttttgga gaggtaaaat cctgcacctg gcccgagaca cacacactat ggggtgatgc 3181 agtggaagaa acggagttga taatcccagt gactctcgcc ggcccacgca gcaagcacaa 3241 cagaagagaa ggatacaagg ttcaagttca aggtccgtgg gatgaagaag acataaaatt 3301 ggactttgac tactgcccag gaacaaccgt cacagtgagt gaacattgtg gaaaacgagg 3361 tccctcagta cgcaccacaa ctgacagcgg gaaactcgtc acggactggt gttgtaggag 3421 ctgcacgctt cctcctttaa gatttaccac ggccagtgga tgttggtatg gaatggagat 3481 aagacctatg aagcatgatg agtccactct agttaaatca agggttcaag catttaatgg 3541 agatatgatt gatccttttc agttaggcct tctggtgatg tttctggcca cccaggaggt 3601 cttgaggaag aggtggacgg ccagacttac tctgccagca gcggttgggg ctctgctagt 3661 cctcctcctt gggggcatta cctacactga tctagtgcgg tatctcatat tggtgggttc 3721 agcatttgca gaatcaaaca acggaggtga tgtcattcat ttggcactca ttgctgtatt 3781 caaggtacag cccgcttttc ttgttgccag cttgacacgc agtagatgga ctaatcaaga 3841 gaatctcgtc ctggtcttgg gagcggcctt ctttcagatg gcagcttcag acctggagtt 3901 gacaattcca ggtttgttga actcagctgc tacagcttgg atggtgttgc gagcaatggc 3961 ttttccgtca acctcagcaa tagccatgcc catgctagca atgcttgctc cgggaatgag 4021 aatgcttcac cttgacacat ataggatagt gctcttgctg attggcattt gcagcttgct 4081 gaatgagaga aggaggtctg tggagaaaaa gaaaggagct gttttaattg gtctagccct 4141 gacttcgact ggatattttt caccaacaat catggcagca ggccttatga tatgcaaccc 4201 taacaagaag agaggatggc ccgccacaga ggtgttaaca gcagtgggac tgatgttcgc 4261 cattgttgga ggactcgctg agctagacat tgattctatg tcagtccctt ttaccatagc 4321 tggcctcatg ttagtgtcct atgttatctc gggaaaagcc acagacatgt ggcttgaaag 4381 agcagcagat gtgtcatggg aagcgggggc agcaataaca ggtaccagcg aaagactgga 4441 tgtccaattg gacgatgatg gagactttca tctgctcaat gaccctggag ttccatggaa 4501 aatctgggtt ttaagaatga catgcttaag cgtagctgcc atcactccat gggccatttt 4561 gccatccgcg tttgggtatt ggctgactct gaagtacaca aaacgagggg gtgttttttg 4621 ggatacgcct tccccaaaag tgtatcccaa gggggatacg acgccaggag tctaccgaat 4681 aatggccaga ggtatcctgg gaaggtacca agctggggta ggagtcatgc atgagggtgt 4741 gttccacacg ctgtggcaca caactagagg agctgccata atgagtggtg aaggaagatt 4801 gacaccatac tggggcaatg tgaaagagga cagagtgacc tatggtggcc catggaaatt 4861 agatcaaaaa tggaatggag tggatgacgt ccagatgata gtggtggaac caggaaagcc 4921 agcaataaac gtgcagacca agcccgggat ttttaagaca gcacacggag aaattggagc 4981 cgtgagcttg gattacccga ttggaacttc aggatcccca atagtcaata gcaatggaga 5041 aatcattggc ctttatggaa atggagtgat actaggtaat ggggcctacg tcagtgctat 5101 tgttcaagga gaaagagtag aggaaccagt gccagaggcc tacaatcctg agatgctaaa 5161 gaaaaggcaa ctaaccgtgc tggacctgca cccaggtgct gggaaaacac gacgcatact 5221 cccccaaata attaaagatg ccatccaaaa aagactacga acagctgttc ttgcaccaac 5281 gagggtggtt gcagcagaaa tggcagaagc tttgaggggc cttccagtta ggtatttgac 5341 tccagctgta caaagagaac acagtggaaa cgagatagtg gatgtgatgt gccatgcgac 5401 actaacacat cgattgatgt cgccgctaag aggccc //