GenBank-Updates@genbank.bio.net (05/29/91)
LOCUS HS4EBVMP 2640 bp ds-DNA VRL 28-MAY-1991
DEFINITION Epstein-Barr virus B95-8 (LMP) gene for membrane proteins
ACCESSION X01995
KEYWORDS membrane protein; overlapping genes; unidentified reading frame.
SOURCE Epstein-Barr virus DNA.
ORGANISM Epstein-Barr virus
Viridae; ds-DNA enveloped viruses; Herpesviridae;
Gammaherpesviridae.
REFERENCE 1 (bases 1 to 2640)
AUTHORS Hudson,G.S., Farrell,P.J. and Barrell,B.G.
TITLE Two related but differentially expressed potential membrane
proteins encoded by the EcoRI Dhet region of Epstein-Barr virus
B95-8
JOURNAL J. Virol. 53, 528-535 (1985)
STANDARD full automatic
COMMENT Data kindly reviewed (16-SEP-1985) by Farrell P.J. This sequence
corresponds to entry <EBV> bases 166930-169570 reversed
From EMBL entry HEEBVMP; dated 13-DEC-1990.
FEATURES Location/Qualifiers
CDS join(96..363,442..528,605..1408)
/product="42kd protein pot. membrane protein"
/codon_start=96
promoter 24..30
/note="ED-L1 promoter (42kd)"
misc_feature 55..58
/note="transcription initiation region (42kd)"
misc_feature 168..227
/note="pot. membrane-spanning or insertion segment 1"
misc_feature 249..311
/note="pot. membrane-spanning or insertion segment 2"
misc_feature 333..363
/note="pot. membrane-spanning or insertion segment 3, part
1"
intron 364..441
/note="intron I"
promoter 369..375
/note="ED-L1A promoter (28kd)"
misc_feature 401..402
/note="transcription initiation region (28kd)"
misc_feature 442..472
/note="pot. membrane-spanning or insertion segment 3, part
2"
misc_feature 486..528
/note="pot. membrane-spanning or insertion segment 4, part
1"
intron 529..604
/note="intron II"
misc_feature 605..630
/note="pot. membrane-spanning or insertion segment 4, part
2"
misc_feature 667..729
/note="pot. membrane-spanning or insertion segment 5"
misc_feature 748..807
/note="pot. membrane-spanning or insertion segment 6"
repeat_region 997..1029
/note="direct repeat 1"
repeat_region 1030..1062
/note="direct repeat 1"
repeat_region 1063..1110
/note="imp. direct repeat 1"
repeat_region 1111..1143
/note="direct repeat 1"
repeat_region 1144..1170
/note="imp. direct repeat 1"
promoter 2045..2051
/note="ED-L2 promoter (6.5kd)"
misc_feature 2075..2075
/note="put. cap site (6.5kd)"
misc_feature 2081..2081
/note="alternative cap site (6.5kd)"
CDS 2084..2263
/note="6.5kd unidentified reading frame (aa 1-60)"
/codon_start=2084
misc_feature 2210..2263
/note="pot. membrane-spanning or insertion segment"
misc_feature 2276..2278
/note="pot. start codon"
misc_feature 2620..2625
/note="polyadenylation signal"
misc_feature 2624..2629
/note="polyadenylation signal"
BASE COUNT 593 a 848 c 552 g 647 t
ORIGIN
1 gcgttactct gacgtagccg ccctacataa gcctctcaca ctgctctgcc cccttctttc
61 ctcaactgcc ttgctcctga cacactgccc tgaggatgga acacgacctt gagaggggcc
121 caccgggccc gcgacggccc cctcgaggac cccccctctc ctcttcccta ggccttgctc
181 tccttctcct cctcttggcg ctactgtttt ggctgtacat cgttatgagt gactggactg
241 gaggagccct ccttgtcctc tattcctttg ctctcatgct tataattata attttgatca
301 tctttatctt cagaagagac cttctctgtc cacttggagc cctttgtata ctcctactga
361 tgagtaagta ttacaccctt tgccccacac cccctttccc ttactcttcc ttctctaacg
421 cactttctcc tctttcccca gtcaccctcc tgctcatcgc tctctggaat ttgcacggac
481 aggcattgtt ccttggaatt gtgctgttca tcttcgggtg cttacttggt aagatctaac
541 attccctagg aattatttac cacaccccca cttttccaac cctaacactc ttttttcaac
601 gcagtcttag gtatctggat ctacttattg gagatgctct ggcgacttgg tgccaccatc
661 tggcagcttt tggccttctt cctagccttc ttcctagacc tcatcctgct cattattgct
721 ctctatctac aacaaaactg gtggactcta ttggttgatc tcctttggct cctcctgttt
781 ctggcgattt taatctggat gtattaccat ggacaacgac acagtgatga acaccaccac
841 gatgactccc tcccgcaccc tcaacaagct accgatgatt ctggccatga atctgactct
901 aactccaacg agggcagaca ccacctgctc gtgagtggag ccggcgacgg acccccactc
961 tgctctcaaa acctaggcgc acctggaggt ggtcctgaca atggcccaca ggaccctgac
1021 aacactgatg acaatggccc acaggaccct gacaacactg atgacaatgg cccacatgac
1081 ccgctgcctc aggaccctga caacactgat gacaatggcc cacaggaccc tgacaacact
1141 gatgacaatg gcccacatga cccgctgcct catagcccta gcgactctgc tggaaatgat
1201 ggaggccctc cacaattgac ggaagaggtt gaaaacaaag gaggtgacca gggcccgcct
1261 ttgatgacag acggaggcgg cggtcatagt catgattccg gccatggcgg cggtgatcca
1321 caccttccta cgctgctttt gggttcttct ggttccggtg gagatgatga cgacccccac
1381 ggcccagttc agctaagcta ctatgactaa cctttcttta cttctaggca ttaccatgtc
1441 ataggcttgc ctgactgact ctccctccat ttactgggaa tgccttagct aatcacctta
1501 actggcacac actcccttag ccacactgtc tgtctaggct gaaaagccac attcatattc
1561 tatttcaaaa caaggggaaa ggaggacatg cgagaattgg cagacacctt tacccagccc
1621 ttaacacacc acacaggtag caaggacccg ggcgttgcca gactccgcca ccaacgcccc
1681 tgcgttgaac ccacccctcc tacacacatc agacctctgc acaacacaac taccaggcag
1741 atgaggcccc ttacttccac agggtactgg cataccagcg ggggaccaca tacatccctg
1801 tctcccaccc agtaactcca gcaactttgc tttccatctt gtgccaatac acatttggat
1861 tcagcccaag ccacacctaa ctcatgccag cagaggcagg aacacctgtt gttgacacat
1921 tctttgcgca taagcacttt aatccctctc tcacacccag aaactaagag ctagcccaaa
1981 acctccacac ctgtcctcgc tcatctttcc acattcctct ggccttcttt ccttgtcctt
2041 actgtataaa agtccacgaa aacagctgtg cctcactctc gagatggtac acgtcctgga
2101 gcgtgctttg ctagagcagc agtcctctgc ctgcggcctg cccggctctt ctacggagac
2161 caggcctagc cacccctgcc ccgaggaccc agacgtcagc agactaagac tactcctggt
2221 ggtactctgt gtcctgtttg gacttttatg cctgctcctc atctaagaag ccaccatgcg
2281 accgggtaga ccactggctg gattctacgc tactctccgc cgttccttca gaagaatgtc
2341 caaaaggtca aagaacaagg ccaagaagga gcgtgtcccc gtggaggacc gcccaccgac
2401 tccgatgccc accagccagc gactgatccg cagaaacgcg ttgggaggag gcgtccgccc
2461 cgatgcggag gactgcatcc aacgcttcca ccccctggag ccagcgctgg gggtgtcaac
2521 aaagaacttt gacctgttgt ccctgagatg tgaattggga tggtgtggat aacatctccc
2581 gctagatggc gcccttatta ttgatgtgac ttgtgatgca ataaataaaa gtacagatag
//