GenBank-Updates@genbank.bio.net (05/29/91)
LOCUS EEEEEE26S 4259 bp ss-mRNA VRL 28-MAY-1991
DEFINITION Eastern equine encephalomyelitis Virus(EEE) 26S mRNA genomic reg.
ACCESSION X05816
KEYWORDS E1 protein; E2 protein; E3 protein; capsid protein.
SOURCE Eastern equine encephalomyelitis virus RNA.
ORGANISM Eastern equine encephalomyelitis virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Togaviridae; Alphaviridae.
REFERENCE 1 (bases 1 to 4259)
AUTHORS Chang,G.J. and Trent,D.W.
TITLE Nucleotide sequence of the genome region encoding the 26S mRNA of
Eastern equine encephalomyelitis virus and the deduced amino acid
sequence of the viral structural proteins
JOURNAL J. Gen. Virol. 68, 2129-2142 (1987)
STANDARD full automatic
COMMENT SWISS-PROT; P08768; POLS$EEEV.
*source: strain=82V-2137; clone=pEE-72 and pEE-14;
From EMBL entry TOEEE26S; dated 02-APR-1988.
FEATURES Location/Qualifiers
misc_feature <1..>0
/note="ns-P4 region (some stop codons found in the 26S
junction region)"
precursor_RNA 121..4259
/note="put. primary transcript (26S mRNA)"
CDS 179..3895
/note="common polyprotein precursor (AA 1-1239)"
/codon_start=179
CDS 179..955
/note="capsid protein (AA 1-259; 259 AA)"
/codon_start=179
CDS 956..1144
/note="E3 protein (AA 260-322;63 AA)"
/codon_start=956
misc_feature 959..1006
/note="pos.transmembrane domain"
misc_feature 986..994
/note="N-linked glycosylation site"
CDS 1145..2404
/note="E2 protein (AA 323-742; 420 AA)"
/codon_start=1145
misc_feature 2048..2056
/note="N-linked glycosylation site"
misc_feature 2087..2095
/note="N-linked glycosylation site"
misc_feature 2228..2280
/note="pos.transmembrane domain"
misc_feature 2357..2389
/note="pos.transmembrane domain"
CDS 2405..2572
/note="6k protein (AA 743-798; 56 AA)"
/codon_start=2405
misc_feature 2506..2572
/note="pos.transmembrane domain"
CDS 2573..3895
/note="E1 protein (AA 799-1239; 441 AA)"
/codon_start=2573
misc_feature 2972..2980
/note="N-linked glycosylation site"
misc_feature 3809..3889
/note="pos.transmembrane domain"
polyA_site 4259..4259
/note="polyA site"
BASE COUNT 1194 a 1088 c 982 g 995 t
ORIGIN
1 atcacgctac gaggtgaact acgtgtcact aatcatcaca gcgttgacta cattagcatc
61 ttcagttagc aactttaaac acataagagg tcaccccata accctctacg gctgacctaa
121 ataggttgtg cattagtacc taacctattt atattatatt gctatctaaa tatcagagat
181 gttcccatac cctacactta actacccgcc tatggcgccg attaacccga tggcctaccg
241 ggatcctaat ccgcctaggc aggtggcgcc ctttaggcca ccacttgcag ctcaaattga
301 ggacctgaga cgttccattg ctaacctgac tttgaaacaa cgagcaccta accctccagc
361 aggaccgccc gccaaacgca agaagcctgc gccaagccta agcctggaga cgaaaaagaa
421 gcgaccacca ccacctgcca agaaacaaaa acgtaaacct aaaccaggca aacgacagcg
481 aatgtgtatg aagctagagt cagataaaac gtttccgatc atgttgaacg gacaggtgaa
541 tggttacgcg tgcgtcgtgg gtggacgagt gtttaaaccg ctgcacgtag aaggcagaat
601 agacaatgag caactggccg ctatcaagct gaagaaggcc agcatatatg accttgagta
661 cggtgatgtg ccacaatgca tgaaatcaga taccctccag tacaccagtg acaagcctcc
721 tggcttttat aactggcatc atggagctgt gcagtatgag aacaacaggt tcaccgtacc
781 acgaggggtc ggtggaaagg gcgacagcgg gagacctatt cttgacaaca aaggtagagt
841 cgtcgcaatt gtccagggtg gagtcaacga aggatccagg acggctctat cagtggtgac
901 atggaaccaa aagggggtta cagtcaaaga tacaccagag gggtcagagc catggtcgct
961 tgccactgtc atgtgcgtcc tggccaatat cacgtttcca tgtgatcaac caccctgcat
1021 gccatgctgt tatgaaaaga atccacacga aacacttacc atgctggaac agaattacga
1081 cagccgagcc tatgatcagc tgctcgatgc cgctgtgaaa tgtaatgcta ggagaaccag
1141 gagagatttg gacactcatt tcacccagta taagttggca cgcccgtata ttgctgattg
1201 ccctaactgt gggcatagtc ggtgcgacag ccctatagct atagaagaag tcagagggga
1261 tgcgcatgca ggagtcatcc gcatccagac atcagctatg ttcggtctga agaggcatgg
1321 agtcgatttg gcctacatga gtttcatgaa cggcaaaacg cagaaatcaa taaagatcga
1381 caacctgcat gtgcgcacct cagccccttg ttccctcgtg tcgcaccacg gctattacat
1441 cttggctcaa tgcccaccag gggacacggt tacagttggg tttcacgacg ggcctaaccg
1501 ccatacgtgc agacttgccc ataaggtaga attcaggcca gtgggtagag agaaataccg
1561 tcacccacct gaacatggag ttgaattacc gtgtaaccgt tacactcaca agcgtgcaga
1621 ccaaggacac tatgttgaga tgcatcaacc agggctagtt ggcgaccact ctctccttag
1681 catccacagt gccaaggtga aaattacggt accgagcggc gcccaagtga aatactactg
1741 caagtgtcca gatgtacgag agggaattac cagcagcgac catacaacca cctgcacgga
1801 tgtcaaacaa tgcagggctt acctgattga caacaaaaaa tgggtgtaca actctggaag
1861 actgcctcga ggagagggcg acacttttaa aggaaaactt catgtgccct ttgtgcctgt
1921 taaggccaag tgcatcgcca cgctggcacc ggagcctcta gttgagcaca aacaccgcac
1981 cctgatttta cacctgcacc cggaccatcc gaccttgctg acgaccagat cacttggaag
2041 tgatgcaaat ccaactcgac aatggattga gcgtccaaca actgtcaatt tcacagtcac
2101 cggagaaggg ttggagtata cctggggaaa ccatccacca aaaagagtat gggctcaaga
2161 gtcaggagaa gggaacccac atggatggcc gcacgttgtg gtagtctatt actacaacag
2221 atacccgtta accacaatta tcgggttatg cacctgtgtg gctatcatca tggtctcttg
2281 tgatcatccg tgtggctcct tttcaggact tcgcaatctt tgcataaccc cgtataaact
2341 agccccgaac gctcaagtcc caatactcct ggcgttactt tgctgcatta agccgacgag
2401 ggcagacgac accttgcaag tgctgaatta tctgtggaac aacaatcaaa actttttctg
2461 gatgcagacg cttatcccac ttgcagcgct tatcgtatgc atgcgcatgc tcgctgcctt
2521 attttgctgt gggccggctt ttttacttgt ctgcggcgct tgggccgcag cgtacgaaca
2581 cacagcagtg atgccgaaca aggtggggat cccgtacaaa gctttagtcg aacgcccagg
2641 ttatgcaccc gttcacctac agatacagct ggttaatacc aggataattc catcaactaa
2701 cctggagtac atcacctgca agtacaagac aaaagtgccg tctccagtag tgaaatgctg
2761 cggtgccact caatgtacct ccaaacccca tcctgactat cagtgtcagg tgtttacagg
2821 tgtttaccca ttcatgtggg gaggagccta ctgcttctgc gacaccgaaa acacccagat
2881 gagcgaggcg tatgtagagc gctcggaaga gtgctctatt gaccacgcaa aagcttataa
2941 agtacacaca ggcactgttc aggcaatggt gaacataact tatgggagcg tcacgtggag
3001 atctgcagat gtctacgtca atggtgaaac tcccgcgaaa ataggagatg ccaaactcat
3061 cataggtcca ctgtcatctg cgtggtcccc attcgataac aaggtggtgg tttatgggca
3121 tgaagtgtat aattacgact ttcctgagta cggcaccggc aaagcaggct cttttggaga
3181 cctgcaatca cgcacatcaa ccagcaacga tctgtacgca aacaccaact tgaagctaca
3241 acgaccccag gctggtatcg tgcacacacc tttcacccag gcgccctctg gcttcgaacg
3301 atggaaaagg gacaaagggg caccgttgaa cgacgtagcc ccgtttggct gttcgattgc
3361 cctggagccg ctccgtccag aaaattgtgc agtgggaagc atccctatat ctatagatat
3421 acccgatgcg gctttcacca gaatatctga aacaccgaca gtctcagacc tggaatgcaa
3481 aattacggag tgtacttatg cctccgattt cggtggtata gccacgttgc ctacaaatcc
3541 agtaaagcag gaaactgtcc aattcattgt ccatcaggtg ttgcagttat taaagagaat
3601 gacgtcaccc ttgctgagag cgggatcatt tacattccac ttctccactg caaacatcca
3661 tcctgctttt aagctgcagg tctgcactag tggcattacc tgcaaaggag attgcaagcc
3721 accgaaagat catatcgtcg attatccagc acaacatacc gaatccttta cgtcggcgat
3781 atccgccacc gcgtggtcgt ggctaaaagt gctggtagga ggaacatcag catttattgt
3841 tctggggctt attgctacag cagtggttgc cctagttctg ttcttccata gacattaaca
3901 tcctgtcaac cacataacac tacaggcagt gtataaggct gtcttactaa acactaaatt
3961 caccctagtt cgatgtactt ccgagctatg gtgacggtgg tgcataatgc cgcccgatgc
4021 agtgcataag gctgctatat taccaaatta taacactaag ggcatgcata atgcttggtc
4081 ctaagtaatt ttatacacac tttataatca ggcataattg ccgtatatac aattacacta
4141 caggtaatat accgcctctt ataaatacta caggcgaagc gcataatgct gccttttata
4201 tcaatttaca aaatcatatt aatttttctt ttatgttttt attttgtttt taatatttc
//