GenBank-Updates@genbank.bio.net (05/29/91)
LOCUS HIVV2RODX 9671 bp ss-mRNA VRL 28-MAY-1991 DEFINITION Human immunodeficiency virus type 2 ROD isolate RNA genome (HIV-2) ACCESSION X05291 KEYWORDS acquired immune deficiency syndrome; art gene; env gene; f gene; gag gene; pol gene; q gene; r gene; tat gene. SOURCE Human immunodeficiency virus type 2 RNA. ORGANISM Human immunodeficiency virus type 2 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (sites) AUTHORS Clavel,F., Guyader,M., Guetard,D., Salle,M., Montagnier,L. and Alizon,M. TITLE Molecular cloning and polymorphism of the human immunodeficiency virus type 2 JOURNAL Nature 324, 691-695 (1986) STANDARD full staff_review REFERENCE 2 (bases 1 to 9671) AUTHORS Alison,M. JOURNAL Unpublished (1987) STANDARD full automatic REFERENCE 3 (bases 1 to 9671) AUTHORS Guyader,M., Emerman,M., Sonigo,P., Clavel,F., Montagnier,L. and Alizon,M. TITLE Genome organization and transactivation of the human immuno- deficiency virus type 2 JOURNAL Nature 326, 662-669 (1987) STANDARD full automatic COMMENT EPD; 16081; HIV-2(ROD). SWISS-PROT; P04577; ENV$HIV2R. SWISS-PROT; P04584; POL$HIV2R. SWISS-PROT; P04590; GAG$HIV2R. SWISS-PROT; P04595; VIF$HIV2R. SWISS-PROT; P04600; NEF$HIV2R. SWISS-PROT; P04605; TAT$HIV2R. SWISS-PROT; P04615; REV$HIV2R. SWISS-PROT; P06938; VPR$HIV2R. SWISS-PROT; P06939; VPX$HIV2R. From EMBL entry HIV2RODX; dated 18-DEC-1990. FEATURES Location/Qualifiers CDS join(5845..6140,8307..8400) /product="tat protein" /codon_start=5845 CDS join(6071..6140,8307..8536) /product="art protein" /codon_start=6071 misc_feature 1..9671 /note="HIV-2 RNA corresponding to integrated proviral DNA" repeat_region 1..299 /note="LTR" misc_feature 1..173 /note="R region" misc_feature 174..299 /note="U5 region" misc_feature 303..320 /note="primer binding site" CDS 546..2111 /product="gag protein" /codon_start=546 CDS 1829..4936 /product="pol protein" /codon_start=1829 misc_feature 4613..4626 /note="polypurine tract 2" CDS 4869..5513 /product="q protein" /codon_start=4869 CDS 5682..5996 /product="r protein" /codon_start=5682 CDS 6147..8720 /product="env protein" /codon_start=6147 CDS 8557..9324 /product="f protein" /codon_start=8557 misc_feature 8925..8939 /note="polypurine tract 1" repeat_region 8942..9671 /note="LTR" misc_feature 8942..9497 /note="U3 region" promoter 9329..9339 /note="core enhancer sequence" promoter 9401..9416 /note="core enhancer sequence" misc_feature 9420..9427 /note="pot. SP1 factor binding site" misc_feature 9428..9437 /note="pot. SP1 factor binding site" misc_feature 9438..9448 /note="pot. SP1 factor binding site" promoter 9465..9470 /note="TATA-box" misc_feature 9498..9671 /note="R region" misc_feature 9649..9654 /note="pot. polyA signal" polyA_site 9671..9671 /note="polyA site" BASE COUNT 3314 a 1973 c 2401 g 1983 t ORIGIN 1 ggtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag 61 gtagagcctg ggtgttccct gctagactct caccagcact tggccggtgc tgggcagacg 121 gccccacgct tgcttgctta aaaacctctt aataaagctg ccagttagaa gcaagttaag 181 tgtgtgctcc catctctcct agtcgccgcc tggtcattcg gtgttcacct gagtaacaag 241 accctggtct gttaggaccc ttcttgcttt gggaaaccga ggcaggaaaa tccctagcag 301 gttggcgcct gaacagggac ttgaagaaga ctgagaagtc ttggaacacg gctgagtgaa 361 ggcagtaagg gcggcaggaa caaaccacga cggagtgctc ctagaaaggc gcgggccgag 421 gtaccaaagg cagcgtgtgg agcgggagga gaagaggcct ccgggtgaag gtaagtacct 481 acaccaaaaa ctgtagccga aagggcttgc tatcctacct ttagacaggt agaagattgt 541 gggagatggg cgcgagaaac tccgtcttga gagggaaaaa agcagatgaa ttagaaagaa 601 tcaggttacg gcccggcgga aagaaaaagt acaggctaaa acatattgtg tgggcagcga 661 ataaattgga cagattcgga ttagcagaga gcctgttgga gtcaaaagag ggttgtcaaa 721 aaattcttac agttttagat ccaatggtac cgacaggttc agaaaattta aaaagtcttt 781 ttaatactgt ctgcgtcatt tggtgcatac acgcagaaga gaaagtgaaa gatactgaag 841 gagcaaaaca aatagtgcgg agacatctag tggcagaaac aggaactgca gagaaaatgc 901 caagcacaag tagaccaaca gcaccatcta gcgagaaggg aggaaattac ccagtgcaac 961 atgtaggcgg caactacacc catataccgc tgagtccccg aaccctaaat gcctgggtaa 1021 aattagtaga ggaaaaaaag ttcggggcag aagtagtgcc aggatttcag gcactctcag 1081 aaggctgcac gccctatgat atcaaccaaa tgcttaattg tgtgggcgac catcaagcag 1141 ccatgcagat aatcagggag attatcaatg aggaagcagc agaatgggat gtgcaacatc 1201 caataccagg ccccttacca gcggggcagc ttagagagcc aaggggatct gacatagcag 1261 ggacaacaag cacagtagaa gaacagatcc agtggatgtt taggccacaa aatcctgtac 1321 cagtaggaaa catctataga agatggatcc agataggatt gcagaagtgt gtcaggatgt 1381 acaacccgac caacatccta gacataaaac agggaccaaa ggagccgttc caaagctatg 1441 tagatagatt ctacaaaagc ttgagggcag aacaaacaga tccagcagtg aagaattgga 1501 tgacccaaac actgctagta caaaatgcca acccagactg taaattagtg ctaaaaggac 1561 tagggatgaa ccctacctta gaagagatgc tgaccgcctg tcagggggta ggtgggccag 1621 gccagaaagc tagattaatg gcagaggccc tgaaagaggt cataggacct gcccctatcc 1681 cattcgcagc agcccagcag agaaaggcat ttaaatgctg gaactgtgga aaggaagggc 1741 actcggcaag acaatgccga gcacctagaa ggcagggctg ctggaagtgt ggtaagccag 1801 gacacatcat gacaaactgc ccagatagac aggcaggttt tttaggactg ggcccttggg 1861 gaaagaagcc ccgcaacttc cccgtggccc aagttccgca ggggctgaca ccaacagcac 1921 ccccagtgga tccagcagtg gatctactgg agaaatatat gcagcaaggg aaaagacaga 1981 gagagcagag agagagacca tacaaggaag tgacagagga cttactgcac ctcgagcagg 2041 gggagacacc atacagggag ccaccaacag aggacttgct gcacctcaat tctctctttg 2101 gaaaagacca gtagtcacag catacattga gggtcagcca gtagaagtct tgttagacac 2161 aggggctgac gactcaatag tagcaggaat agagttaggg aacaattata gcccaaaaat 2221 agtaggggga atagggggat tcataaatac caaggaatat aaaaatgtag aaatagaagt 2281 tctaaataaa aaggtacggg ccaccataat gacaggcgac accccaatca acatttttgg 2341 cagaaatatt ctgacagcct taggcatgtc attaaatcta ccagtcgcca aagtagagcc 2401 aataaaaata atgctaaagc cagggaaaga tggaccaaaa ctgagacaat ggcccttaac 2461 aaaagaaaaa atagaagcac taaaagaaat ctgtgaaaaa atggaaaaag aaggccagct 2521 agaggaagca cctccaacta atccttataa tacccccaca tttgcaatca agaaaaagga 2581 caaaaacaaa tggaggatgc taatagattt cagagaacta aacaaggtaa ctcaagattt 2641 cacagaaatt cagttaggaa ttccacaccc agcagggttg gccaagaaga gaagaattac 2701 tgtactagat gtaggggatg cttacttttc cataccacta catgaggact ttagaccata 2761 tactgcattt actctaccat cagtgaacaa tgcagaacca ggaaaaagat acatatataa 2821 agtcttgcca cagggatgga agggatcacc agcaattttt caacacacaa tgagacaggt 2881 attagaacca ttcagaaaag caaacaagga tgtcattatc attcagtaca tggatgatat 2941 cttaatagct agtgacagga cagatttaga acatgatagg gtagtcctgc agctcaagga 3001 acttctaaat ggcctaggat tttctacccc agatgagaag ttccaaaaag accctccata 3061 ccactggatg ggctatgaac tatggccaac taaatggaag ttgcagaaaa tacagttgcc 3121 ccaaaaagaa atatggacag tcaatgacat ccagaagcta gtgggtgtcc taaattgggc 3181 agcacaactc tacccaggga taaagaccaa acacttatgt aggttaatca gaggaaaaat 3241 gacactcaca gaagaagtac agtggacaga attagcagaa gcagagctag aagaaaacag 3301 aattatccta agccaggaac aagagggaca ctattaccaa gaagaaaaag agctagaagc 3361 aacagtccaa aaggatcaag agaatcagtg gacatataaa atacaccagg aagaaaaaat 3421 tctaaaagta ggaaaatatg caaaggtgaa aaacacccat accaatggaa tcagattgtt 3481 agcacaggta gttcagaaaa taggaaaaga agcactagtc atttggggac gaataccaaa 3541 atttcaccta ccagtagaga gagaaatctg ggagcagtgg tgggataact actggcaagt 3601 gacatggatc ccagactggg acttcgtgtc taccccacca ctggtcaggt tagcgtttaa 3661 cctggtaggg gatcctatac caggtgcaga gaccttctac acagatggat cctgcaatag 3721 gcaatcaaaa gaaggaaaag caggatatgt aacagataga gggaaagaca aggtaaagaa 3781 actagagcaa actaccaatc agcaagcaga actagaagcc tttgcgatgg cactaacaga 3841 ctcgggtcca aaagttaata ttatagtaga ctcacagtat gtaatgggga tcagtgcaag 3901 ccaaccaaca gagtcagaaa gtaaaatagt gaaccagatc atagaagaaa tgataaaaaa 3961 ggaagcaatc tatgttgcat gggtcccagc ccacaaaggc atagggggaa accaggaagt 4021 agatcattta gtgagtcagg gtatcagaca agtgttgttc ctggaaaaaa tagagcccgc 4081 tcaggaagaa catgaaaaat atcatagcaa tgtaaaagaa ctgtctcata aatttggaat 4141 acccaattta gtggcaaggc aaatagtaaa ctcatgtgcc caatgtcaac agaaagggga 4201 agctatacat gggcaagtaa atgcagaact aggcacttgg caaatggact gcacacattt 4261 agaaggaaag atcattatag tagcagtaca tgttgcaagt ggatttatag aagcagaagt 4321 catcccacag gaatcaggaa gacaaacagc actcttccta ttgaaactgg caagtaggtg 4381 gccaataaca cacttgcata cagataatgg tgccaacttc acttcacagg aggtgaagat 4441 ggtagcatgg tggataggta tagaacaatc ctttggagta ccttacaatc cacagagcca 4501 aggagtagta gaagcaatga atcaccatct aaaaaaccaa ataagtagaa tcagagaaca 4561 ggcaaataca atagaaacaa tagtactaat ggcaattcat tgcatgaatt ttaaaagaag 4621 ggggggaata ggggatatga ctccatcaga aagattaatc aatatgatca ccacagaaca 4681 agagatacaa ttcctccaag ccaaaaattc aaaattaaaa gattttcggg tctatttcag 4741 agaaggcaga gatcagttgt ggaaaggacc tggggaacta ctgtggaaag gagaaggagc 4801 agtcctagtc aaggtaggaa cagacataaa aataatacca agaaggaaag ccaagatcat 4861 cagagactat ggaggaagac aagagatgga tagtggttcc cacctggagg gtgccaggga 4921 ggatggagaa atggcatagc cttgtcaagt atctaaaata caaaacaaag gatctagaaa 4981 aggtgtgcta tgttccccac cataaggtgg gatgggcatg gtggacttgc agcagggtaa 5041 tattcccatt aaaaggaaac agtcatctag agatacaggc atattggaac ttaacaccag 5101 aaaaaggatg gctctcctct tattcagtaa gaataacttg gtacacagaa aagttctgga 5161 cagatgttac cccagactgt gcagatgtcc taatacatag cacttatttc ccttgcttta 5221 cagcaggtga agtaagaaga gccatcagag gggaaaagtt attgtcctgc tgcaattatc 5281 cccgagctca tagagcccag gtaccgtcac ttcaatttct ggccttagtg gtagtgcaac 5341 aaaatgacag accccagaga gacagtacca ccaggaaaca gcggcgaaga gactatcgga 5401 gaggccttcg cctggctaaa caggacagta gaagccataa acagagaagc agtgaatcac 5461 ctaccccgag aacttatttt ccaggtgtgg cagaggtcct ggagatactg gcatgatgaa 5521 caagggatgt cagaaagtta cacaaagtat agatatttgt gcataataca gaaagcagtg 5581 tacatgcatg ttaggaaagg gtgtacttgc ctggggaggg gacatgggcc aggagggtgg 5641 agaccagggc ctcctcctcc tccccctcca ggtctggtct aatggctgaa gcaccaacag 5701 agctcccccc ggtggatggg accccactga gggagccagg ggatgagtgg ataatagaaa 5761 tcttgagaga aataaaagaa gaagctttaa agcattttga ccctcgcttg ctaattgctc 5821 ttggcaaata tatctatact agacatggag acacccttga aggcgccaga gagctcatta 5881 aagtcctgca acgagccctt ttcacgcact tcagagcagg atgtggccac tcaagaattg 5941 gccagacaag gggaggaaat cctctctcag ctataccgac ccctagaaac atgcaataac 6001 tcatgctatt gtaagcgatg ctgctaccat tgtcagatgt gttttctaaa caaggggctc 6061 gggatatgtt atgaacgaaa gggcagacga agaaggactc caaagaaaac taagactcat 6121 ccgtctccta caccagacaa gtgagtatga tgaatcagct gcttattgcc attttattag 6181 ctagtgcttg cttagtatat tgcacccaat atgtaactgt tttctatggc gtacccacgt 6241 ggaaaaatgc aaccattccc ctcttttgtg caaccagaaa tagggatact tggggaacca 6301 tacagtgctt gcctgacaat gatgattatc aggaaataac tttgaatgta acagaggctt 6361 ttgatgcatg gaataataca gtaacagaac aagcaataga agatgtctgg catctattcg 6421 agacatcaat aaaaccatgt gtcaaactaa cacctttatg tgtagcaatg aaatgcagca 6481 gcacagagag cagcacaggg aacaacacaa cctcaaagag cacaagcaca accacaacca 6541 cacccacaga ccaggagcaa gagataagtg aggatactcc atgcgcacgc gcagacaact 6601 gctcaggatt gggagaggaa gaaacgatca attgccagtt caatatgaca ggattagaaa 6661 gagataagaa aaaacagtat aatgaaacat ggtactcaaa agatgtggtt tgtgagacaa 6721 ataatagcac aaatcagacc cagtgttaca tgaaccattg caacacatca gtcatcacag 6781 aatcatgtga caagcactat tgggatgcta taaggtttag atactgtgca ccaccgggtt 6841 atgccctatt aagatgtaat gataccaatt attcaggctt tgcacccaac tgttctaaag 6901 tagtagcttc tacatgcacc aggatgatgg aaacgcaaac ttccacatgg tttggcttta 6961 atggcactag agcagagaat agaacatata tctattggca tggcagagat aatagaacta 7021 tcatcagctt aaacaaatat tataatctca gtttgcattg taagaggcca gggaataaga 7081 cagtgaaaca aataatgctt atgtcaggac atgtgtttca ctcccactac cagccgatca 7141 ataaaagacc cagacaagca tggtgctggt tcaaaggcaa atggaaagac gccatgcagg 7201 aggtgaagga aacccttgca aaacatccca ggtatagagg aaccaatgac acaaggaata 7261 ttagctttgc agcgccagga aaaggctcag acccagaagt agcatacatg tggactaact 7321 gcagaggaga gtttctctac tgcaacatga cttggttcct caattggata gagaataaga 7381 cacaccgcaa ttatgcaccg tgccatataa agcaaataat taacacatgg cataaggtag 7441 ggagaaatgt atatttgcct cccagggaag gggagctgtc ctgcaactca acagtaacca 7501 gcataattgc taacattgac tggcaaaaca ataatcagac aaacattacc tttagtgcag 7561 aggtggcaga actatacaga ttggagttgg gagattataa attggtagaa ataacaccaa 7621 ttggcttcgc acctacaaaa gaaaaaagat actcctctgc tcacgggaga catacaagag 7681 gtgtgttcgt gctagggttc ttgggttttc tcgcaacagc aggttctgca atgggcgcgg 7741 cgtccctgac cgtgtcggct cagtcccgga ctttactggc cgggatagtg cagcaacagc 7801 aacagctgtt ggacgtggtc aagagacaac aagaactgtt gcgactgacc gtctggggaa 7861 cgaaaaacct ccaggcaaga gtcactgcta tagagaagta cctacaggac caggcgcggc 7921 taaattcatg gggatgtgcg tttagacaag tctgccacac tactgtacca tgggttaatg 7981 attccttagc acctgactgg gacaatatga cgtggcagga atgggaaaaa caagtccgct 8041 acctggaggc aaatatcagt aaaagtttag aacaggcaca aattcagcaa gagaaaaata 8101 tgtatgaact acaaaaatta aatagctggg atatttttgg caattggttt gacttaacct 8161 cctgggtcaa gtatattcaa tatggagtgc ttataatagt agcagtaata gctttaagaa 8221 tagtgatata tgtagtacaa atgttaagta ggcttagaaa gggctatagg cctgttttct 8281 cttccccccc cggttatatc caacagatcc atatccacaa ggaccgggga cagccagcca 8341 acgaagaaac agaagaagac ggtggaagca acggtggaga cagatactgg ccctggccga 8401 tagcatatat acatttcctg atccgccagc tgattcgcct cttgaccaga ctatacagca 8461 tctgcaggga cttactatcc aggagcttcc tgaccctcca actcatctac cagaatctca 8521 gagactggct gagacttaga acagccttct tgcaatatgg gtgcgagtgg atccaagaag 8581 cattccaggc cgccgcgagg gctacaagag agactcttgc gggcgcgtgc aggggcttgt 8641 ggagggtatt ggaacgaatc gggaggggaa tactcgcggt tccaagaagg atcagacagg 8701 gagcagaaat cgccctcctg tgagggacgg cagtatcagc agggagactt tatgaatact 8761 ccatggaagg acccagcagc agaaagggag aaaaatttgt acaggcaaca aaatatggat 8821 gatgtagatt cagatgatga tgaccaagta agagtttctg tcacaccaaa agtaccacta 8881 agaccaatga cacatagatt ggcaatagat atgtcacatt taataaaaac aaggggggga 8941 ctggaaggga tgttttacag tgaaagaaga cataaaatct taaatatata cttagaaaag 9001 gaagaaggga taattgcaga ttggcagaac tacactcatg ggccaggagt aagataccca 9061 atgttctttg ggtggctatg gaagctagta ccagtagatg tcccacaaga aggggaggac 9121 actgagactc actgcttagt acatccagca caaacaagca agtttgatga cccgcatggg 9181 gagacactag tctgggagtt tgatcccttg ctggcttata gttacgaggc ttttattcgg 9241 tacccagagg aatttgggca caagtcaggc ctgccagagg aagagtggaa ggcgagactg 9301 aaagcaagag gaataccatt tagttaaaga caggaacagc tatacttggt cagggcagga 9361 agtaactaac agaaacagct gagactgcag ggactttcca gaaggggctg taaccaaggg 9421 agggacatgg gaggagctgg tggggaacgc cctcatattc tctgtataaa tatacccgct 9481 agcttgcatt gtacttcggt cgctctgcgg agaggctggc agattgagcc ctgggaggtt 9541 ctctccagca gtagcaggta gagcctgggt gttccctgct agactctcac cagcacttgg 9601 ccggtgctgg gcagacggcc ccacgcttgc ttgcttaaaa acctccttaa taaagctgcc 9661 agttagaagc a //