[bionet.molbio.genbank.updates] Eastern equine encephalomyelitis Virus

GenBank-Updates@genbank.bio.net (05/29/91)

LOCUS       EEEEEE26S    4259 bp ss-mRNA            VRL       28-MAY-1991
DEFINITION  Eastern equine encephalomyelitis Virus(EEE) 26S mRNA genomic reg.
ACCESSION   X05816
KEYWORDS    E1 protein; E2 protein; E3 protein; capsid protein.
SOURCE      Eastern equine encephalomyelitis virus RNA.
  ORGANISM  Eastern equine encephalomyelitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
            Togaviridae; Alphaviridae.
REFERENCE   1  (bases 1 to 4259)
  AUTHORS   Chang,G.J. and Trent,D.W.
  TITLE     Nucleotide sequence of the genome region encoding the 26S mRNA of
            Eastern equine encephalomyelitis virus and the deduced amino acid
            sequence of the viral structural proteins
  JOURNAL   J. Gen. Virol. 68, 2129-2142 (1987)
  STANDARD  full automatic
COMMENT     SWISS-PROT; P08768; POLS$EEEV.
            
            *source: strain=82V-2137; clone=pEE-72 and pEE-14;
            
            From EMBL    entry TOEEE26S;  dated 02-APR-1988.
FEATURES             Location/Qualifiers
     misc_feature    <1..>0
                     /note="ns-P4 region (some stop codons found in the 26S
                     junction region)"
     precursor_RNA   121..4259
                     /note="put. primary transcript (26S mRNA)"
     CDS             179..3895
                     /note="common polyprotein precursor (AA 1-1239)"
                     /codon_start=179
     CDS             179..955
                     /note="capsid protein (AA 1-259; 259 AA)"
                     /codon_start=179
     CDS             956..1144
                     /note="E3 protein (AA 260-322;63 AA)"
                     /codon_start=956
     misc_feature    959..1006
                     /note="pos.transmembrane domain"
     misc_feature    986..994
                     /note="N-linked glycosylation site"
     CDS             1145..2404
                     /note="E2 protein (AA 323-742; 420 AA)"
                     /codon_start=1145
     misc_feature    2048..2056
                     /note="N-linked glycosylation site"
     misc_feature    2087..2095
                     /note="N-linked glycosylation site"
     misc_feature    2228..2280
                     /note="pos.transmembrane domain"
     misc_feature    2357..2389
                     /note="pos.transmembrane domain"
     CDS             2405..2572
                     /note="6k protein (AA 743-798; 56 AA)"
                     /codon_start=2405
     misc_feature    2506..2572
                     /note="pos.transmembrane domain"
     CDS             2573..3895
                     /note="E1 protein (AA 799-1239; 441 AA)"
                     /codon_start=2573
     misc_feature    2972..2980
                     /note="N-linked glycosylation site"
     misc_feature    3809..3889
                     /note="pos.transmembrane domain"
     polyA_site      4259..4259
                     /note="polyA site"
BASE COUNT     1194 a   1088 c    982 g    995 t
ORIGIN
        1 atcacgctac gaggtgaact acgtgtcact aatcatcaca gcgttgacta cattagcatc
       61 ttcagttagc aactttaaac acataagagg tcaccccata accctctacg gctgacctaa
      121 ataggttgtg cattagtacc taacctattt atattatatt gctatctaaa tatcagagat
      181 gttcccatac cctacactta actacccgcc tatggcgccg attaacccga tggcctaccg
      241 ggatcctaat ccgcctaggc aggtggcgcc ctttaggcca ccacttgcag ctcaaattga
      301 ggacctgaga cgttccattg ctaacctgac tttgaaacaa cgagcaccta accctccagc
      361 aggaccgccc gccaaacgca agaagcctgc gccaagccta agcctggaga cgaaaaagaa
      421 gcgaccacca ccacctgcca agaaacaaaa acgtaaacct aaaccaggca aacgacagcg
      481 aatgtgtatg aagctagagt cagataaaac gtttccgatc atgttgaacg gacaggtgaa
      541 tggttacgcg tgcgtcgtgg gtggacgagt gtttaaaccg ctgcacgtag aaggcagaat
      601 agacaatgag caactggccg ctatcaagct gaagaaggcc agcatatatg accttgagta
      661 cggtgatgtg ccacaatgca tgaaatcaga taccctccag tacaccagtg acaagcctcc
      721 tggcttttat aactggcatc atggagctgt gcagtatgag aacaacaggt tcaccgtacc
      781 acgaggggtc ggtggaaagg gcgacagcgg gagacctatt cttgacaaca aaggtagagt
      841 cgtcgcaatt gtccagggtg gagtcaacga aggatccagg acggctctat cagtggtgac
      901 atggaaccaa aagggggtta cagtcaaaga tacaccagag gggtcagagc catggtcgct
      961 tgccactgtc atgtgcgtcc tggccaatat cacgtttcca tgtgatcaac caccctgcat
     1021 gccatgctgt tatgaaaaga atccacacga aacacttacc atgctggaac agaattacga
     1081 cagccgagcc tatgatcagc tgctcgatgc cgctgtgaaa tgtaatgcta ggagaaccag
     1141 gagagatttg gacactcatt tcacccagta taagttggca cgcccgtata ttgctgattg
     1201 ccctaactgt gggcatagtc ggtgcgacag ccctatagct atagaagaag tcagagggga
     1261 tgcgcatgca ggagtcatcc gcatccagac atcagctatg ttcggtctga agaggcatgg
     1321 agtcgatttg gcctacatga gtttcatgaa cggcaaaacg cagaaatcaa taaagatcga
     1381 caacctgcat gtgcgcacct cagccccttg ttccctcgtg tcgcaccacg gctattacat
     1441 cttggctcaa tgcccaccag gggacacggt tacagttggg tttcacgacg ggcctaaccg
     1501 ccatacgtgc agacttgccc ataaggtaga attcaggcca gtgggtagag agaaataccg
     1561 tcacccacct gaacatggag ttgaattacc gtgtaaccgt tacactcaca agcgtgcaga
     1621 ccaaggacac tatgttgaga tgcatcaacc agggctagtt ggcgaccact ctctccttag
     1681 catccacagt gccaaggtga aaattacggt accgagcggc gcccaagtga aatactactg
     1741 caagtgtcca gatgtacgag agggaattac cagcagcgac catacaacca cctgcacgga
     1801 tgtcaaacaa tgcagggctt acctgattga caacaaaaaa tgggtgtaca actctggaag
     1861 actgcctcga ggagagggcg acacttttaa aggaaaactt catgtgccct ttgtgcctgt
     1921 taaggccaag tgcatcgcca cgctggcacc ggagcctcta gttgagcaca aacaccgcac
     1981 cctgatttta cacctgcacc cggaccatcc gaccttgctg acgaccagat cacttggaag
     2041 tgatgcaaat ccaactcgac aatggattga gcgtccaaca actgtcaatt tcacagtcac
     2101 cggagaaggg ttggagtata cctggggaaa ccatccacca aaaagagtat gggctcaaga
     2161 gtcaggagaa gggaacccac atggatggcc gcacgttgtg gtagtctatt actacaacag
     2221 atacccgtta accacaatta tcgggttatg cacctgtgtg gctatcatca tggtctcttg
     2281 tgatcatccg tgtggctcct tttcaggact tcgcaatctt tgcataaccc cgtataaact
     2341 agccccgaac gctcaagtcc caatactcct ggcgttactt tgctgcatta agccgacgag
     2401 ggcagacgac accttgcaag tgctgaatta tctgtggaac aacaatcaaa actttttctg
     2461 gatgcagacg cttatcccac ttgcagcgct tatcgtatgc atgcgcatgc tcgctgcctt
     2521 attttgctgt gggccggctt ttttacttgt ctgcggcgct tgggccgcag cgtacgaaca
     2581 cacagcagtg atgccgaaca aggtggggat cccgtacaaa gctttagtcg aacgcccagg
     2641 ttatgcaccc gttcacctac agatacagct ggttaatacc aggataattc catcaactaa
     2701 cctggagtac atcacctgca agtacaagac aaaagtgccg tctccagtag tgaaatgctg
     2761 cggtgccact caatgtacct ccaaacccca tcctgactat cagtgtcagg tgtttacagg
     2821 tgtttaccca ttcatgtggg gaggagccta ctgcttctgc gacaccgaaa acacccagat
     2881 gagcgaggcg tatgtagagc gctcggaaga gtgctctatt gaccacgcaa aagcttataa
     2941 agtacacaca ggcactgttc aggcaatggt gaacataact tatgggagcg tcacgtggag
     3001 atctgcagat gtctacgtca atggtgaaac tcccgcgaaa ataggagatg ccaaactcat
     3061 cataggtcca ctgtcatctg cgtggtcccc attcgataac aaggtggtgg tttatgggca
     3121 tgaagtgtat aattacgact ttcctgagta cggcaccggc aaagcaggct cttttggaga
     3181 cctgcaatca cgcacatcaa ccagcaacga tctgtacgca aacaccaact tgaagctaca
     3241 acgaccccag gctggtatcg tgcacacacc tttcacccag gcgccctctg gcttcgaacg
     3301 atggaaaagg gacaaagggg caccgttgaa cgacgtagcc ccgtttggct gttcgattgc
     3361 cctggagccg ctccgtccag aaaattgtgc agtgggaagc atccctatat ctatagatat
     3421 acccgatgcg gctttcacca gaatatctga aacaccgaca gtctcagacc tggaatgcaa
     3481 aattacggag tgtacttatg cctccgattt cggtggtata gccacgttgc ctacaaatcc
     3541 agtaaagcag gaaactgtcc aattcattgt ccatcaggtg ttgcagttat taaagagaat
     3601 gacgtcaccc ttgctgagag cgggatcatt tacattccac ttctccactg caaacatcca
     3661 tcctgctttt aagctgcagg tctgcactag tggcattacc tgcaaaggag attgcaagcc
     3721 accgaaagat catatcgtcg attatccagc acaacatacc gaatccttta cgtcggcgat
     3781 atccgccacc gcgtggtcgt ggctaaaagt gctggtagga ggaacatcag catttattgt
     3841 tctggggctt attgctacag cagtggttgc cctagttctg ttcttccata gacattaaca
     3901 tcctgtcaac cacataacac tacaggcagt gtataaggct gtcttactaa acactaaatt
     3961 caccctagtt cgatgtactt ccgagctatg gtgacggtgg tgcataatgc cgcccgatgc
     4021 agtgcataag gctgctatat taccaaatta taacactaag ggcatgcata atgcttggtc
     4081 ctaagtaatt ttatacacac tttataatca ggcataattg ccgtatatac aattacacta
     4141 caggtaatat accgcctctt ataaatacta caggcgaagc gcataatgct gccttttata
     4201 tcaatttaca aaatcatatt aatttttctt ttatgttttt attttgtttt taatatttc
//