GenBank-Updates@genbank.bio.net (05/16/91)
LOCUS PCSLTRA 8785 bp ss-RNA VRL 16-MAY-1991
DEFINITION Simian sarcoma virus (SMRV-HLB; SMRV-H) complete genome.
ACCESSION M23385
KEYWORDS aspartyl protease; complete genome; env glycoprotein; gag protein;
outer membrane protein TmpA; pol protein; transmembrane protein.
SOURCE Simian sarcoma virus (type D; SMRV-HLB; SMRV-H) RNA, passed in
human LMB cell line, clones pHVQ and pSMH.
ORGANISM Simian sarcoma virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Mammalian type C oncoviruses.
REFERENCE 1 (bases 1 to 8785)
AUTHORS Oda,T., Ikeda,S., Watanabe,S., Hatsushika,M., Akiyama,K. and
Mitsunobu,F.
TITLE Molecular cloning, complete nucleotide sequence, and gene structure
of the provirus genome of a retrovirus produced in a human
lymphoblastoid cell line
JOURNAL Virology 167, 468-476 (1988)
STANDARD full staff_review
COMMENT Draft entry and clean copy of sequence kindly provided by
T.Oda, 24-MAR-1989.
FEATURES Location/Qualifiers
misc_signal 332..337
/note="poly-A signal"
CDS 6314..8041
/note="env membrane protein precursor"
/codon_start=6314
mat_peptide 6314..7471
/note="env outer membrane protein"
/codon_start=6314
mat_peptide 7472..8038
/note="env transmembrane protein"
/codon_start=7472
LTR 8661..8666
/note="3' long terminal repeat"
misc_signal 8661..8666
/note="poly-A signal"
BASE COUNT 2298 a 2620 c 1734 g 2133 t
ORIGIN
1 tgttgggaac ccaggctaag ctgatgctat tggaacaaaa ggttgcatca gccctccccg
61 ccaaaagcat gccccgggcg tggtggcggg ccaccaatgg aggacctgat cacgggcaag
121 acatgcctca gggccaccaa tggaggacct gatcacgggc aagacatgcc tcagggccac
181 caataaagga cctgatcacg cacagaacat gcatggctgc accaatgggg tagctgatca
241 tgagctaaac actcgtctcc cagcctatca gaactacttc ccctttcccc tctctacccg
301 ctcccctccc tatataagga acccattttg aaataaactt tgcagcttga tcagaacttt
361 tgtcttgctg ccattcttcg cgcctcttgt cccatccctt tctcatccac gacaggcttg
421 ctcgggttcc tgtttgttgc tctgcgggac agagcaagtg gcgcccagga cgtggggctc
481 gatgccggcc tccgtggacc gccgtcccct gtaaccggtt ccctgcacgg ccgtcccgat
541 taaccgattc cccgcacgga gcaccgcgga ccacccgacc gcgagccgac tcctggagtt
601 cgttcctcat ttcgacggcg gcattactca agtaagaccc aatcatggga caagcatctt
661 cacacagtga aaatgatctc tttataagtc acttaaagga atctctcaag gtgcgtagaa
721 ttcgggttcg caagaaagac cttgtctcct tttttagttt catttttaaa acatgtccat
781 ggttccccca ggaagggtct attgactccc gtgtttgggg acgtgttggt gattgtctga
841 acgactatta ccgtgttttt ggtcctgaga ctattccgat caccactttt aattattata
901 atttaataag ggacgtcctt actaatcaga gcgactcccc tgacattcaa cgcctctgca
961 aggagggtca caaaattctt attagccact cccgacctcc atctagacaa gcccctgtaa
1021 caattaccac ctctgaaaag gcctcctctc gccctccctc tagggcccct tctacctgcc
1081 cctcggttgc aattgacatt ggttcacatg atacagggca gtcttcactg taccccaacc
1141 ttgcaaccct tacggacccc cccattcaaa gtcctcattc tcgggcgcat actccgcccc
1201 aacatttgcc cttgcttgct aattctaaaa ccttgcataa ctcgggtagt caggatgatc
1261 aactaaatcc cgccgatcaa gcagatctgg aagaggcagc tgcccaatat aacaaccctg
1321 attggcccca attaactaac accccggcat tgccaccttt ccggccaccg tcctatgttt
1381 ctacagcagt gcccccagtg gcagttgcgg ctcctgtttt gcatgcccct acttcaggcg
1441 ttcctggttc ccccacggcc ccaaacttgc ccggtgtagc cctagccaaa ccctccggtc
1501 ccattgatga gactgtttct cttcttgatg gggttaaaac cttagtcaca aaactgtccg
1561 atttggccct tctacctccc gcgggagtta tggcttttcc cgttaccaga agtcagggac
1621 aggttagctc caataccacg ggccgagcgt ctcctcaccc tgacacacac accatccctg
1681 aggaggagga agcagactcc ggagaatctg actcagagga tgacgaggag gaaagctcag
1741 agcccaccga gcctacctac acccattcct ataagcggct aaatctaaag accatagaaa
1801 aaattaaaac tgctgttgct aactatggtc ctactgcccc ctttaccgtg gcccttgtag
1861 agagtcttag tgaaagatgg cttaccccta gtgattggtt tttcttgtct cgtgctgcgc
1921 tgagcggagg ggacaatatc ctttggaagt ctgagtatga ggatatttcc aaacagtttg
1981 cagagcgaac gcgcgtaagg cctcctccaa aggatggacc cttaaaaatt cctggcgcca
2041 gcccttatca gaacaatgac aaacaggccc aattcccccc agggctttta acccagattc
2101 agtccgcagg cctaaaagcc tggaagcgac tccctcaaaa gggagcggct actacttccc
2161 ttgcaaagat tagacaaggc cccgatgagt catacagtga ttttgtaagc cgcctccagg
2221 agacggcaga tcgccttttt ggctccgggg aaagtgagag ctcctttgta aaacacctag
2281 cctatgaaaa cgctaacccc gcttgccaaa gtgcaattcg gccttttagg cagaaggagc
2341 tttcgactat gtcgcctctg ctctggtatt gctctgccca tgctgttggc ctagccatag
2401 gagctgccct ccaaaatctt gcccccgcgc aactcctgga gcccaggccc gcctttgcta
2461 taattgtcac caacccggcc atctttcaag aaactgcccc caaaaaaata caaccaccta
2521 ctcaactccc aactcaacct aatgccccac aggctagcct tataaaaaat ttaggtccca
2581 caacaaaatg tcctcgctgc aaaaaaggat ttcactgggc ttcagaatgc cgttctcgat
2641 tagacattaa tggacaaccc attattaagc agggaaactt gaacaggggc cagccccagg
2701 gccccactac cgggatgaac tccggggctt cacagttcac cccccaatac cgccagccaa
2761 cccctgccct cccagtaatc aaccacgccg ctacgtcaca gacctctggc gagcaacagc
2821 gggcagtgca ggactggacc tctgtaccac caccgacaca atactaacca cccaaaatag
2881 ccctctgaca cttccagttg gaatatatgg acccttacca ccccagacat tcggcctcat
2941 attagcagag ccagctctac cctccaaggg gatccaagtt ctgcccggca tattagacaa
3001 tgattttgag ggagaaatcc atatcattct ctctacaact aaagatttag tcaccatccc
3061 aaagggcacc agactagctc aaatagtcat tctccccctc caacaaatta actccaattt
3121 ccataagccc taccgcgggg ctagtgcccc tgggtcttct gatgtctact gggttcaaca
3181 aatttctcaa cagcggccta ccctgaaact taaattaaat ggtaagctct tttctggcat
3241 tcttgataca ggggccgatg ccaccgttat atcttacact cactggccga ggaactggcc
3301 gttaacaacc gttgctactc acctgcgcgg tattggccag gccaccaacc cccaacaaag
3361 tgctcaaatg cttaagtggg aggactctga aggcaataat ggtcacatta ccccttatgt
3421 cctccccaat ctgccagtca atctctgggg aagggacatc ctctctcaaa tgaaacttgt
3481 catgtgcagt cccaacgata ctgtcatgac ccaaatgcta agccaggggt atctccccgg
3541 ccaagggttg ggaaaaaata atcaaggaat cacccagccc attactatta cccccaaaaa
3601 agacaaaaca ggcctaggat tccaccaaaa tttaccgtag tcgtgccatt gacattcctg
3661 taccccacgc tgacaaaatt tcctggaaaa ttacagaccc tgtgtgggtt gatcagtggc
3721 cacttacata tgagaaaacc ctcgctgcca ttgcgttagt acaggaacag ctcgcagcag
3781 gacatattga gcccacaaat tctccatgga atactcctat attcatcatt aagaaaaaat
3841 caggtagctg gcgtctttta caggatctaa gagccgttaa taaggtaatg gtccccatgg
3901 gagcccttca gcctggtctt ccctctcctg tagccatccc cctaaactat cacaaaattg
3961 ttattgacct taaggattgt ttctttacca tccccttaca ccctgaagac agaccttact
4021 ttgcctttag cgtccctcaa atcaacttcc aaagtcctat gcctcgttat cagtggaagg
4081 ttctgccaca gggcatggcc aacagtccca cactgtgcca aaaatttgtt gctgccgcca
4141 ttgccccagt aagatcccag tggccagagg cctatatcct ccattatatg gatgacatcc
4201 ttcttgcttg tgacagcgcc gaggcagcca aggcctgcta tgctcacatt atatcctgtc
4261 ttacctcata tggactaaaa attgctccag acaaggtaca agtgtctgag ccattttctt
4321 atttaggatt tgagttacac catcagcaag tatttactcc ccgagtctgc ttaaaaactg
4381 atcacttaaa aacccttaac gatttccaaa aattactcgg ggacattcag tggcttcgac
4441 cctatttaaa attgcccacc agtgcccttg ttccccttaa caatattcta aaaggcgatc
4501 caaatccttt atcggttcga gcactgaccc cagaggcaaa gcaatctcta gccctcatca
4561 acaaggctat ccaaaatcaa agtgttcaac aaatttcgta taaccttccc ctagtactcc
4621 tcttgctccc aactccccat acacccaccg cggtgttttg gcaaccaaac ggtacagacc
4681 ctacaaaaaa cggaagcccc ctcctttggc tccatctacc tgcctcccca tcaaaagtct
4741 tactcaccta cccctcgctc ctcgccatgt taattattaa gggtcggtac actggccgcc
4801 aactgtttgg cagggacccc cactctataa tcattccata cacccaggac caattaacct
4861 ggctcctgca aacctctgac gaatgggcca ttgcattatc ctccttcaca ggagacatag
4921 acaatcatta ccccagtgac cctgttatcc aatttgccaa gcttcaccag ttcatattcc
4981 ccaagatcac aaaatgtgcc ccaattcctc aagccacgct agttttcact gatggatcct
5041 caaacggaat tgctgcatat gttattgata atcaacccat ctcaataaaa tccccctacc
5101 tgtcagctca acttgttgag ctctatgcta ttctccaggt gttcacagtt ctagctcacc
5161 aaccgtttaa cttgtacact gacagtgcgt atattgctca atcagtccct cttttggaga
5221 cagtcccctt tatcaaatcc tcaaccaatg ctaccccctt attttctaaa ctgcaacagc
5281 taattttaaa cagacaacac cctttcttta tcggacatct tcgggcccac ctaaatcttc
5341 caggacccct ggctgaaggc aatgccttag ctgatgctgc cacacagatt ttccccatta
5401 taagtgaccc aatacatgag gctactcaag ctcacaccct acatcacctc aatgcacaca
5461 ccctacgatt actctataaa attactagag aacaagccag agatattgta aaagcttgca
5521 aacagtgtgt cgtagccacc cctgtacccc atcttggcgt gaacccccgt ggtttagtcc
5581 ccaatgccat ttggcaaatg gatgtcactc attttactcc ttttggaaaa cagaggtttg
5641 ttcatgttac tgttgacaca tttagtggtt ttatcttagc cactccccaa acaggtgaag
5701 catcaaaaaa tgttatatct catgttatcc actgtcttgc taccatagga aaaccacaca
5761 ccattaaaac agacaatggc ccgggatata ctggaaaaaa cttccaagac ttttgccaaa
5821 aactccaaat caaacatgtt actggtatac cgtacaaccc ccagggtcaa ggagtagttg
5881 aacgagctca tcaaacatta aaaaatgccc taaatcgctt agcccgctcc ccccttgggt
5941 tttctatgca acaacccaga aaccttctta gtcatgccct atttcaacta aattttctac
6001 agcttgacag tcaagggcgc tcggcagctg accgtctatg gcatccccaa acttctcagc
6061 agcatgctac ggttatgtgg cgtgaccctc tcaccagtgt ttggaagggc cctgaccctg
6121 tcctcatatg ggggcgaggc tcagcctgca tatacgatca aaaggaggat ggcccccgct
6181 ggctccctga gcgactaatt agacacatca ataatcagac agcccccttg tgtgacaggc
6241 caagtaaccc aaatacagcc ccagggccaa aaggctcgcc ctgaggagct ccttttctct
6301 tcttccagga agaatgctct gcatcctcat cctcctactg cacccacgcc tctgcccagt
6361 cacaaaggga ggacttggaa agccatccgg agacatttac actgccctct ttggagcgcc
6421 atgtgactgt aaagggggga ctcagaccaa taattacgcc accccaactt acactcaggt
6481 aacagattgt ggggacaaaa atgcctatct tacctatgac accaattgga atggagtatc
6541 ttcacctaag tggctttgtg tgcgcaagcc tcctagtata ccggtcatta atggccgccc
6601 aggcccgtgc ccaagcgagt gcacaaacaa cattaaatcc cagatgcact cctcctgcta
6661 ttctagtttc tcacagtgta ctcaaggcaa taatacttat tttactgcca ttctacaaag
6721 aacaaagagc acctcagaaa ccaatcctgt caccagcggc ctacaacctc atggggtcct
6781 ccaggccgga tgcgatggca cggttggaaa atcggtttgt tggaatcagc aagcccctat
6841 tcacgtctcc gacggtggcg gaccccaaga tgctgtgaga gagctttatg tacaaaaaca
6901 aatagagctt gttattcaaa gccaattccc taagttatcc taccaccccc tagctcgctc
6961 aaaaccaaga ggacctgaca ttgatgcaca aatgcttgat attctgtcag ccacccacca
7021 ggccctcaat atctccaacc ccagcctagc ccaaaattgc tggttatgct taaatcaagg
7081 tacctccatg cccctagcct tccctgtcaa tatatctagt tttaatgcct cacaaaataa
7141 ttgcaccccc agcttaccct ttagagtcca gcccatgcct tcccaagtat acccttgctt
7201 ctttaaaggt gcacaaaaca acagctttga tattccggtt ggcgttgcca actttgtaaa
7261 ctgctccagt agttccaacc acagtgaggc cctttgccct ggcccaggcc aagcttttgt
7321 ttgcggcaac aacctcgcct ttactgctct gcctgcaaac tggacagggt catgtgtgtt
7381 agccgccctc ctgccagata tagacattat ttctggtgat gaccctgtcc ctatccctac
7441 ctttgactat attgcagggc ggcagaaacg agccgttaca ctgattcccc tgctagtagg
7501 attgggtgtc tctacagcag tcgctaccgg tacagcagga ctcggggtgg ctgttcaatc
7561 ttacacaaaa ctttcccatc aacttattaa cgacgtccaa gccttgtcta gcaccattaa
7621 tgacttacag gaccaactag attccctagc cgaagtagtc ctccaaaaca gaagaggctt
7681 agacctactc actgcagaac agggaggtat ctgtttggct ctacaggaac gttgctgctt
7741 ttatgccaac aagtcaggaa ttgtccgaga taaaataaaa aatctacaag aagacctcga
7801 aaaaagacgc aaggcacttg cagacaatct cttcctcacc ggcctcaatg gacttctccc
7861 ttacctcctc cccttccttg gacccttatt cgctatcatc ctgttcttct cttttgcccc
7921 ttggatccta agacgagtaa cagcgttaat cagggatcag ctcaattccc tactgggaaa
7981 gcccatacaa atccactatc accaactagc aacgcgtgat ctagaatatg gcagactgta
8041 gccggttccc ctcctacggg agcagcatac cgctcgacac tatgctttac gaaggtaatg
8101 gacaccgcta ggtgcaaggc aaggcactgc aaggagaggc cttactaagg ctactgtcga
8161 gtctcctgag aggtaagctg gcttgcatag aggttggtac tcgaaaaatc ctctcctccc
8221 aaaaaggtac ctgtaagcct gaaaattaag gctcaggagg agcacagcct ctacctcccc
8281 tagctggtta aggtccgcct cctctttttt taaagaaaaa gggaggagat gttgggaacc
8341 caggctaagc tgatgctatt ggaacaaaag gttgcatcag ccctccccgc caaaagcatg
8401 ccccgggcgt ggtggcgggc caccaatgga ggacctgatc acgggcaaga catgcctcag
8461 ggccaccaat ggaggacctg atcacgggca agacatgcct cagggccacc aataaaggac
8521 ctgatcacgc acagaacatg catggctgca ccaatggggt agctgatcat gagctaaaca
8581 ctcgtctccc agcctatcag aactacttcc cctttcccct ctctacccgc tcccctccct
8641 atataaggaa cccattttga aataaacttt gcagcttgat cagaactttt gtcttgctgc
8701 cattcttcgc gcctcttgtc ccatcccttt ctcatccacg acaggcttgc tcgggttcct
8761 gtttgttgct ctgcgggaca gagca
//