GenBank-Updates@genbank.bio.net (05/29/91)
LOCUS MVEMVEV5 5436 bp ss-mRNA VRL 28-MAY-1991
DEFINITION Murray Valley Encephalitis virus genome 5' region (5.4 kb) with
genes C-prM(M)-E-NS1-ns2a-ns2b-NS3 (N-term)
ACCESSION X03467
KEYWORDS capsid protein; envelope glycoprotein; glycoprotein;
membrane protein.
SOURCE Murray Valley encephalitis virus RNA.
ORGANISM Murray Valley encephalitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Flaviviridae; Flavivirus (arbovirus group B).
REFERENCE 1 (bases 1 to 5436)
AUTHORS Dalgarno,L., Trent,D.W., Strauss,J.H. and Rice,C.M.
TITLE Partial nucleotide sequence of the Murray Valley encephalitis virus
genome - Comparison of the encoded polypeptides with yellow fever
virus structural and non-structural proteins
JOURNAL J. Mol. Biol. 187, 309-323 (1986)
STANDARD full automatic
COMMENT SWISS-PROT; P05769; POLG$MVEV.
Data kindly reviewed (01-AUG-1986) by Strauss J.H.
From EMBL entry FLMVEV5; dated 06-JUL-1989.
FEATURES Location/Qualifiers
CDS 98..472
/note="put. capsid protein"
/codon_start=98
CDS 473..973
/note="put. membrane protein precursor (formerly GP19)"
/codon_start=473
misc_feature 515..517
/note="pot. N-linked glycosylation site"
CDS 749..973
/note="put. mature membrane protein"
/codon_start=749
CDS 974..2476
/note="put. envelope protein"
/codon_start=974
misc_feature 1433..1435
/note="pot. N-linked glycosylation site"
CDS 2477..>2476
/note="put. NS1 protein (formerly gp44 or NV3)"
/codon_start=2477
misc_feature 2864..2866
/note="pot. N-linked glycosylation site"
misc_feature 3095..3097
/note="pot. N-linked glycosylation site"
CDS 3719..4213
/note="pot. ns2a protein"
/codon_start=3719
CDS 4214..4606
/note="pot. ns2b protein"
/codon_start=4214
CDS 4607..>5436
/note="put. NS3 protein (formerly P71 or NV4) (5436 is 2nd
base in codon)"
/codon_start=4607
BASE COUNT 1525 a 1156 c 1457 g 1294 t 4 others
ORIGIN
1 nnnnacgttc atctgcgtga gcttccgatc tcagtattgt ttggaaggat cattgattaa
61 cgcggtttga acagtttttt ggagcttttg atttcaaatg tctaaaaaac caggaggacc
121 cgggaagccc cgggtcgtca atatgctaaa acgcggcata ccccgcgtat tcccactagt
181 gggagtgaag agggtagtaa tgaacttgct agatggcaga gggccaatac ggtttgtgtt
241 ggctctctta gctttcttca ggtttacagc acttgccccg accaaggcct tgatgaggcg
301 ctggaagagc gtgaacaaga caacggccat gaaacatctg accagtttta agaaagaatt
361 aggaacactg attgatgtgg tgaacaaaag gggcaaaaaa caaaagaaaa gaggtggcag
421 tgaaacatcc gtgcttatgg tcattttcat gctgattgga tttgccgctg ccttaaagct
481 ttccaccttc cagggcaaga taatgatgac tgtgaacgct acggacattg ctgatgtgat
541 cgccattcca accccgaagg gacccaatca atgctggatt cgagccattg acattggatt
601 tatgtgtgat gacaccatca cttatgaatg cccgaaattg gaaagtggaa atgaccctga
661 agacattgac tgctggtgtg acaaacaagc tgtgtacgta aactatggaa ggtgcacacg
721 tgctcgccat tcaaagcgca gtcgtcgttc catcacagtg cagactcatg gtgaaagcac
781 tttggtcaac aaaaaggatg cctggctgga ttccacgaag gccacgcgtt atctcaccaa
841 aacagagaac tggattataa gaaatcctgg ttacgcgctg gtggccgttg tccttggctg
901 gatgctgggc agcaacactg gacaaaaagt tatttttaca gtgcttttgc tcctcgttgc
961 tcctgcctac agttttaact gtctgggaat gagcagccgt gatttcattg aaggtgcttc
1021 aggagctaca tgggtcgatt tggtgctgga gggcgacagt tgcatcacca tcatggccgc
1081 tgacaaaccc acccttgaca taagaatgat gaacattgaa gccaccaatc ttgcactggt
1141 tagaaattac tgctatgcag ctactgtgtc agacgtttct acggtgtcaa actgtcctac
1201 tacaggggag tcacacaaca cgaagcgggc agatcacaat tacttgtgca aacgaggtgt
1261 gaccgacaga ggctggggta atggatgtgg cttgtttggt aaggggagca ttgacacatg
1321 cgcaaagttc acctgctcta actcagctgc ggggagactt atcttacctg aggacatcaa
1381 atatgaagtt ggggtttttg ttcacggatc aacggactca accagtcatg gaaattattc
1441 tacccaaatt ggagctaacc aagcagtcag gttcaccatt tcaccaaacg ctccagccat
1501 cacagcaaag atgggcgact atggagaagt cactgtggag tgtgaaccga ggagtggact
1561 gaatacagag gcctactacg tcatgaccat tggaacgaaa cactttctag tgcatcgtga
1621 gtggttcaat gatttgctct tgccatggac atcacctgca agcacggaat ggaggaatag
1681 agaaattctc gtggagtttg aagagccaca tgccaccaaa caatcagtgg ttgccttggg
1741 ttcacaggaa ggagctttgc accaagctct ggctggagcc ataccagtcg agttttcgag
1801 cagcacactt aaactcactt caggacacct taagtgtcgc gtgaaaatgg agaaattgaa
1861 actgaaagga accacttatg ggatgtgcac agaaaaattt actttctcaa agaatccagc
1921 cgacaccggt catggcacgg tagttctaga actgcagtac accgggagtg atggaccatg
1981 caaaattcca atatcctctg tagcaagtct caatgacatg acgcctgtcg ggagaatggt
2041 gacagctaat ccatatgtag cttcatcaac tgccaatgct aaagttctgg tggagattga
2101 accacccttc ggagactcat acattgtggt aggcagggga gacaagcaga tcaatcacca
2161 ctggcataag gagggtagtt caattggcaa agccttcagc acaaccttga agggagcaca
2221 gagattggca gctcttggag acacggcgtg ggactttgga tcagtaggcg gagtcttcaa
2281 ttcaatcgga aaggcagtac accaagtctt tggaggagca tttagaaccc tctttggagg
2341 aatgtcatgg atcagccaag gtctgctggg ggcgttactg ctatggatgg gggtcaatgc
2401 tagagataaa tcaattgctt tggctttcct agcaacagga ggcgttttgt tgttcctggc
2461 cacaaatgtc catgctgaca ctggttgcgc gattgacatc accaggaggg agctcaagtg
2521 tggcagcgga atattcatac acaatgatgt tgaagcctgg attgaccgct acaagtacct
2581 gccagagacc cccaagcaac tggctaaagt ggttgaaaac gctcacaaga gcggaatatg
2641 tgggatacgg tcagtgaata gatttgaaca tcaaatgtgg gaatctgtgc gtgatgaact
2701 caatgcttta ctcaaggaaa atgccattga tttgagtgtt gtcgtggaaa aacagaaagg
2761 catgtacaga gcagcaccca ataggctgag actcactgtg gaagaacttg atataggctg
2821 gaaggcctgg ggtaagagtt tgctttttgc ggcggaattg gccaattcaa cgtttgtggt
2881 tgatggacct gaaacagctg aatgtcctaa ttcaaagagg gcatggaaca gctttgaaat
2941 tgaagacttt ggatttggca taacatccac cagggtttgg cttaaactca gagaggaaaa
3001 tacctcggag tgtgacagca ccattattgg gaccgcagtt aagggcaacc atgcagtaca
3061 cagtgacctt tcctactgga ttgagagtgg actcaatggg acatggaaac ttgagagagc
3121 catttttgga gaggtaaaat cctgcacctg gcccgagaca cacacactat ggggtgatgc
3181 agtggaagaa acggagttga taatcccagt gactctcgcc ggcccacgca gcaagcacaa
3241 cagaagagaa ggatacaagg ttcaagttca aggtccgtgg gatgaagaag acataaaatt
3301 ggactttgac tactgcccag gaacaaccgt cacagtgagt gaacattgtg gaaaacgagg
3361 tccctcagta cgcaccacaa ctgacagcgg gaaactcgtc acggactggt gttgtaggag
3421 ctgcacgctt cctcctttaa gatttaccac ggccagtgga tgttggtatg gaatggagat
3481 aagacctatg aagcatgatg agtccactct agttaaatca agggttcaag catttaatgg
3541 agatatgatt gatccttttc agttaggcct tctggtgatg tttctggcca cccaggaggt
3601 cttgaggaag aggtggacgg ccagacttac tctgccagca gcggttgggg ctctgctagt
3661 cctcctcctt gggggcatta cctacactga tctagtgcgg tatctcatat tggtgggttc
3721 agcatttgca gaatcaaaca acggaggtga tgtcattcat ttggcactca ttgctgtatt
3781 caaggtacag cccgcttttc ttgttgccag cttgacacgc agtagatgga ctaatcaaga
3841 gaatctcgtc ctggtcttgg gagcggcctt ctttcagatg gcagcttcag acctggagtt
3901 gacaattcca ggtttgttga actcagctgc tacagcttgg atggtgttgc gagcaatggc
3961 ttttccgtca acctcagcaa tagccatgcc catgctagca atgcttgctc cgggaatgag
4021 aatgcttcac cttgacacat ataggatagt gctcttgctg attggcattt gcagcttgct
4081 gaatgagaga aggaggtctg tggagaaaaa gaaaggagct gttttaattg gtctagccct
4141 gacttcgact ggatattttt caccaacaat catggcagca ggccttatga tatgcaaccc
4201 taacaagaag agaggatggc ccgccacaga ggtgttaaca gcagtgggac tgatgttcgc
4261 cattgttgga ggactcgctg agctagacat tgattctatg tcagtccctt ttaccatagc
4321 tggcctcatg ttagtgtcct atgttatctc gggaaaagcc acagacatgt ggcttgaaag
4381 agcagcagat gtgtcatggg aagcgggggc agcaataaca ggtaccagcg aaagactgga
4441 tgtccaattg gacgatgatg gagactttca tctgctcaat gaccctggag ttccatggaa
4501 aatctgggtt ttaagaatga catgcttaag cgtagctgcc atcactccat gggccatttt
4561 gccatccgcg tttgggtatt ggctgactct gaagtacaca aaacgagggg gtgttttttg
4621 ggatacgcct tccccaaaag tgtatcccaa gggggatacg acgccaggag tctaccgaat
4681 aatggccaga ggtatcctgg gaaggtacca agctggggta ggagtcatgc atgagggtgt
4741 gttccacacg ctgtggcaca caactagagg agctgccata atgagtggtg aaggaagatt
4801 gacaccatac tggggcaatg tgaaagagga cagagtgacc tatggtggcc catggaaatt
4861 agatcaaaaa tggaatggag tggatgacgt ccagatgata gtggtggaac caggaaagcc
4921 agcaataaac gtgcagacca agcccgggat ttttaagaca gcacacggag aaattggagc
4981 cgtgagcttg gattacccga ttggaacttc aggatcccca atagtcaata gcaatggaga
5041 aatcattggc ctttatggaa atggagtgat actaggtaat ggggcctacg tcagtgctat
5101 tgttcaagga gaaagagtag aggaaccagt gccagaggcc tacaatcctg agatgctaaa
5161 gaaaaggcaa ctaaccgtgc tggacctgca cccaggtgct gggaaaacac gacgcatact
5221 cccccaaata attaaagatg ccatccaaaa aagactacga acagctgttc ttgcaccaac
5281 gagggtggtt gcagcagaaa tggcagaagc tttgaggggc cttccagtta ggtatttgac
5341 tccagctgta caaagagaac acagtggaaa cgagatagtg gatgtgatgt gccatgcgac
5401 actaacacat cgattgatgt cgccgctaag aggccc
//