GenBank-Updates@genbank.bio.net (04/16/91)
LOCUS PPH16 7904 bp ds-DNA Circular VRL 16-APR-1991 DEFINITION Human papillomavirus type 16 (HPV16), complete genome. ACCESSION K02718 KEYWORDS circular; complete genome. SOURCE Papilloma virus type 16 DNA, isolated from a human invasive cervical carcinoma. ORGANISM Human papillomavirus Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 7904) AUTHORS Seedorf,K., Kraemmer,G., Duerst,M., Suhai,S. and Roewekamp,W.G. TITLE Human papillomavirus type 16 DNA sequence JOURNAL Virology 145, 181-185 (1985) STANDARD full staff_review REFERENCE 2 (sites) AUTHORS Kennedy,I.M., Haddow,J.K. and Clements,J.B. TITLE A negative element in the human poapillomavirus type 16 genome acts at the level of late mRNA stability JOURNAL J. Virol. 65, 2093-2097 (1991) STANDARD full staff_review COMMENT The sense strand of this double-stranded circular genome is shown, with a numbering system matching the first 60 bp of HPVa1, HPV6b and BPV1. The annotation of sites and features is solely based upon homology comparison with these other papillomaviruses. In addition to the coding sequences reported below, the authors note open reading frames which do not start with 'ATG', but which are found in other papillomaviruses. In particular, a second portion of the E1 gene may be located out to base 2813 (the E1 protein is thought to be generally involved in DNA replication). A potential 'CAT'-box region is found beginning at base 7895 below, and 'TATA' boxes for early and late transcripts may be located at 17, 65 and 4289. Potential polyadenylation signals are at bases 4213 and 7260. HPV16, in comparison to HPV types 6 and 11, is more often associated with malignant genital cancers in humans. FEATURES Location/Qualifiers CDS 83..559 /note="E6 (putative)" /codon_start=83 CDS 562..858 /note="E7 (putative)" /codon_start=562 CDS 865..1170 /note="E1 (putative)" /codon_start=865 CDS 2755..3852 /note="E2 (putative)" /codon_start=2755 CDS 4235..5656 /note="L2 (putative)" /codon_start=4235 CDS 5559..7154 /note="L1 (putative)" /codon_start=5559 BASE COUNT 2601 a 1377 c 1509 g 2417 t ORIGIN Unreported. 1 actacaataa ttcatgtata aaactaaggg cgtaaccgaa atcggttgaa ccgaaaccgg 61 ttagtataaa agcagacatt ttatgcacca aaagagaact gcaatgtttc aggacccaca 121 ggagcgaccc agaaagttac cacagttatg cacagagctg caaacaacta tacatgatat 181 aatattagaa tgtgtgtact gcaagcaaca gttactgcga cgtgaggtat atgactttgc 241 ttttcgggat ttatgcatag tatatagaga tgggaatcca tatgctgtat gtgataaatg 301 tttaaagttt tattctaaaa ttagtgagta tagacattat tgttatagtt tgtatggaac 361 aacattagaa cagcaataca acaaaccgtt gtgtgatttg ttaattaggt gtattaactg 421 tcaaaagcca ctgtgtcctg aagaaaagca aagacatctg gacaaaaagc aaagattcca 481 taatataagg ggtcggtgga ccggtcgatg tatgtcttgt tgcagatcat caagaacacg 541 tagagaaacc cagctgtaat catgcatgga gatacaccta cattgcatga atatatgtta 601 gatttgcaac cagagacaac tgatctctac tgttatgagc aattaaatga cagctcagag 661 gaggaggatg aaatagatgg tccagctgga caagcagaac cggacagagc ccattacaat 721 attgtaacct tttgttgcaa gtgtgactct acgcttcggt tgtgcgtaca aagcacacac 781 gtagacattc gtactttgga agacctgtta atgggcacac taggaattgt gtgccccatc 841 tgttctcaga aaccataatc taccatggct gatcctgcag gtaccaatgg ggaagagggt 901 acgggatgta atggatggtt ttatgtagag gctgtagtgg aaaaaaaaac aggggatgct 961 atatcagatg acgagaacga aaatgacagt gatacaggtg aagatttggt agattttata 1021 gtaaatgata atgattattt aacacaggca gaaacagaga cagcacatgc gttgtttact 1081 gcacaggaag caaaacaaca tagagatgca gtacaggttc taaaacgaaa gtatttggta 1141 gtccacttag tgatattagt ggatgtgtag acaataatat tagtcctaga ttaaaagcta 1201 tatgtataga aaaacaaagt agagctgcaa aaaggagatt atttgaaagc gaagacagcg 1261 ggtatggcaa tactgaagtg gaaactcagc agatgttaca ggtagaaggg cgccatgaga 1321 ctgaaacacc atgtagtcag tatagtggtg gaagtggggg tggttgcagt cagtacagta 1381 gtggaagtgg gggagagggt gttagtgaaa gacacactat atgccaaaca ccacttacaa 1441 atattttaaa tgtactaaaa actagtaatg caaaggcagc aatgttagca aaatttaaag 1501 agttatacgg ggtgagtttt tcagaattag taagaccatt taaaagtaat aaatcaacgt 1561 gttgcgattg gtgtattgct gcatttggac ttacacccag tatagctgac agtataaaaa 1621 cactattaca acaatattgt ttatatttac acattcaaag tttagcatgt tcatggggaa 1681 tggttgtgtt actattagta agatataaat gtggaaaaaa tagagaaaca attgaaaaat 1741 tgctgtctaa actattatgt gtgtctccaa tgtgtatgat gatagagcct ccaaaattgc 1801 gtagtacagc agcagcatta tattggtata aaacaggtat atcaaatatt agtgaagtgt 1861 atggagacac gccagaatgg atacaaagac aaacagtatt acaacatagt tttaatgatt 1921 gtacatttga attatcacag atggtacaat gggcctacga taatgacata gtagacgata 1981 gtgaaattgc atataaatat gcacaattgg cagacactaa tagtaatgca agtgcctttc 2041 taaaaagtaa ttcacaggca aaaattgtaa aggattgtgc aacaatgtgt agacattata 2101 aacgagcaga aaaaaaacaa atgagtatga gtcaatggat aaaatataga tgtgataggg 2161 tagatgatgg aggtgattgg aagcaaattg ttatgttttt aaggtatcaa ggtgtagagt 2221 ttatgtcatt tttaactgca ttaaaaagat ttttgcaagg catacctaaa aaaaattgca 2281 tattactata tggtgcagct aacacaggta aatcattatt tggtatgagt ttaatgaaat 2341 ttctgcaagg gtctgtaata tgttttgtaa attctaaaag ccatttttgg ttacaaccat 2401 tagcagatgc caaaataggt atgttagatg atgctacagt gccctgttgg aactacatag 2461 atgacaattt aagaaatgca ttggatggaa atttagtttc tatggatgta aagcatagac 2521 cattggtaca actaaaatgc cctccattat taattacatc taacattaat gctggtacag 2581 attctaggtg gccttattta cataatagat tggtggtgtt tacatttcct aatgagtttc 2641 catttgacga aaacggaaat ccagtgtatg agcttaatga taagaactgg aaatcctttt 2701 tctcaaggac gtggtccaga ttaagtttgc acgaggacga ggacaaggaa aacgatggag 2761 actctttgcc aacgtttaaa tgtgtgtcag gacaaaatac taacacatta tgaaaatgat 2821 agtacagacc tacgtgacca tatagactat tggaaacaca tgcgcctaga atgtgctatt 2881 tattacaagg ccagagaaat gggatttaaa catattaacc accaagtggt gccaacactg 2941 gctgtatcaa agaataaagc attacaagca attgaactgc aactaacgtt agaaacaata 3001 tataactcac aatatagtaa tgaaaagtgg acattacaag acgttagcct tgaagtgtat 3061 ttaactgcac caacaggatg tataaaaaaa catggatata cagtggaagt gcagtttgat 3121 ggagacatat gcaatacaat gcattataca aactggacac atatatatat ttgtgaagaa 3181 gcatcagtaa ctgtggtaga gggtcaagtt gactattatg gtttatatta tgttcatgaa 3241 ggaatacgaa catattttgt gcagtttaaa gatgatgcag aaaaatatag taaaaataaa 3301 gtatgggaag ttcatgcggg tggtcaggta atattatgtc ctacatctgt gtttagcagc 3361 aacgaagtat cctctcctga aattattagg cagcacttgg ccaaccaccc cgccgcgacc 3421 cataccaaag ccgtcgcctt gggcaccgaa gaaacacaga cgactatcca gcgaccaaga 3481 tcagagccag acaccggaaa cccctgccac accactaagt tgttgcacag agactcagtg 3541 gacagtgctc caatcctcac tgcatttaac agctcacaca aaggacggat taactgtaat 3601 agtaacacta cacccatagt acatttaaaa ggtgatgcta atactttaaa atgtttaaga 3661 tatagattta aaaagcattg tacattgtat actgcagtgt cgtctacatg gcattggaca 3721 ggacataatg taaaacataa aagtgcaatt gttacactta catatgatag tgaatggcaa 3781 cgtgaccaat ttttgtctca agttaaaata ccaaaaacta ttacagtgtc tactggattt 3841 atgtctatat gacaaatctt gatactgcat ccacaacatt actggcgtgc tttttgcttt 3901 gctttgtgtg cttttgtgtg tctgcctatt aatacgtccg ctgcttttgt ctgtgtctac 3961 atacacatca ttaataatat tggtattact attgtggata acagcagcct ctgcgtttag 4021 gtgttttatt gtatatatta tatttgttta tataccatta tttttaatac atacacatgc 4081 acgcttttta attacataat gtatatgtac ataatgtaat tgttacatat aattgttgta 4141 taccataact tactattttt tcttttttat tttcatatat aatttttttt tttgtttgtt 4201 tgtttgtttt ttaataaact gttattactt aacaatgcga cacaaacgtt ctgcaaaacg 4261 cacaaaacgt gcatcggcta cccaacttta taaaacatgc aaacaggcag gtacatgtcc 4321 acctgacatt atacctaagg ttgaaggcaa aactattgct gaacaaatat tacaatatgg 4381 aagtatgggt gtattttttg gtgggttagg aattggaaca gggtcgggta caggcggacg 4441 cactgggtat attccattgg gaacaaggcc tcccacagct acagatacac ttgctcctgt 4501 aagaccccct ttaacagtag atcctgtggg cccttctgat ccttctatag tttctttagt 4561 ggaagaaact agttttattg atgctggtgc accaacatct gtaccttcca ttcccccaga 4621 tgtatcagga tttagtatta ctacttcaac tgataccaca cctgctatat tagatattaa 4681 taatactgtt actactgtta ctacacataa taatcccact ttcactgacc catctgtatt 4741 gcagcctcca acacctgcag aaactggagg gcattttaca ctttcatcat ccactattag 4801 tacacataat tatgaagaaa ttcctatgga tacatttatt gttagcacaa accctaacac 4861 agtaactagt agcacaccca taccagggtc tcgcccagtg gcacgcctag gattatatag 4921 tcgcacaaca caacaggtta aagttgtaga ccctgctttt gtaaccactc ccactaaact 4981 tattacatat gataatcctg catatgaagg tatagatgtg gataatacat tatatttttc 5041 tagtaatgat aatagtatta atatagctcc agatcctgac tttttggata tagttgcttt 5101 acataggcca gcattaacct ctaggcgtac tggcattagg tacagtagaa ttggtaataa 5161 acaaacacta cgtactcgta gtggaaaatc tataggtgct aaggtacatt attattatga 5221 tttaagtact attgatcctg cagaagaaat agaattacaa actataacac cttctacata 5281 tactaccact tcacatgcag cctcacctac ttctattaat aatggattat atgatattta 5341 tgcagatgac tttattacag atacttctac aaccccggta ccatctgtac cctctacatc 5401 tttatcaggt tatattcctg caaatacaac aattcctttt ggtggtgcat acaatattcc 5461 tttagtatca ggtcctgata tacccattaa tataactgac caagctcctt cattaattcc 5521 tatagttcca gggtctccac aatatacaat tattgctgat gcaggtgact tttatttaca 5581 tcctagttat tacatgttac gaaaacgacg taaacgttta ccatattttt tttcagatgt 5641 ctctttggct gcctagtgag gccactgtct acttgcctcc tgtcccagta tctaaggttg 5701 taagcacgga tgaatatgtt gcacgcacaa acatatatta tcatgcagga acatccagac 5761 tacttgcagt tggacatccc tattttccta ttaaaaaacc taacaataac aaaatattag 5821 ttcctaaagt atcaggatta caatacaggg tatttagaat acatttacct gaccccaata 5881 agtttggttt tcctgacacc tcattttata atccagatac acagcggctg gtttgggcct 5941 gtgtaggtgt tgaggtaggt cgtggtcagc cattaggtgt gggcattagt ggccatcctt 6001 tattaaataa attggatgac acagaaaatg ctagtgctta tgcagcaaat gcaggtgtgg 6061 ataatagaga atgtatatct atggattaca aacaaacaca attgtgttta attggttgca 6121 aaccacctat aggggaacac tggggcaaag gatccccatg taccaatgtt gcagtaaatc 6181 caggtgattg tccaccatta gagttaataa acacagttat tcaggatggt gatatggttc 6241 atactggctt tggtgctatg gactttacta cattacaggc taacaaaagt gaagttccac 6301 tggatatttg tacatctatt tgcaaatatc cagattatat taaaatggtg tcagaaccat 6361 atggcgacag cttatttttt tatttacgaa gggaacaaat gtttgttaga catttattta 6421 atagggctgg tactgttggt gaaaatgtac cagacgattt atacattaaa ggctctgggt 6481 ctactgcaaa tttagccagt tcaaattatt ttcctacacc tagtggttct atggttacct 6541 ctgatgccca aatattcaat aaaccttatt ggttacaacg agcacagggc cacaataatg 6601 gcatttgttg gggtaaccaa ctatttgtta ctgttgttga tactacacgc agtacaaata 6661 tgtcattatg tgctgccata tctacttcag aaactacata taaaaatact aactttaagg 6721 agtacctacg acatggggag gaatatgatt tacagtttat ttttcaactg tgcaaaataa 6781 ccttaactgc agacgttatg acatacatac attctatgaa ttccactatt ttggaggact 6841 ggaattttgg tctacaacct cccccaggag gcacactaga agatacttat aggtttgtaa 6901 cccaggcaat tgcttgtcaa aaacatacac ctccagcacc taaagaagat gatcccctta 6961 aaaaatacac tttttgggaa gtaaatttaa aggaaaagtt ttctgcagac ctagatcagt 7021 ttcctttagg acgcaaattt ttactacaag caggattgaa ggccaaacca aaatttacat 7081 taggaaaacg aaaagctaca cccaccacct catctacctc tacaactgct aaacgcaaaa 7141 aacgtaagct gtaagtattg tatgtatgtt gaattagtgt tgtttgttgt gtatatgttt 7201 gtatgtgctt gtatgtgctt gtaaatatta agttgtatgt gtgtttgtat gtatggtata 7261 ataaacacgt gtgtatgtgt ttttaaatgc ttgtgtaact attgtgtcat gcaacataaa 7321 taaacttatt gtttcaacac ctactaattg tgttgtggtt attcattgta tataaactat 7381 atttgctaca tcctgttttt gttttatata tactatattt tgtagcgcca ggcccatttt 7441 gtagcttcaa ccgaattcgg ttgcatgctt tttggcacaa aatgtgtttt tttaaatagt 7501 tctatgtcag caactatggt ttaaacttgt acgtttcctg cttgccatgc gtgccaaatc 7561 cctgttttcc tgacctgcac tgcttgccaa ccattccatt gttttttaca ctgcactatg 7621 tgcaactact gaatcactat gtacattgtg tcatataaaa taaatcacta tgcgccaacg 7681 ccttacatac cgctgttagg cacatatttt tggcttgttt taactaacct aattgcatat 7741 ttggcataag gtttaaactt ctaaggccaa ctaaatgtca ccctagttca tacatgaact 7801 gtgtaaaggt tagtcataca ttgttcattt gtaaaactgc acatgggtgt gtgcaaaccg 7861 attttgggtt acacatttac aagcaactta tataataata ctaa //