GenBank-Updates@genbank.bio.net (04/16/91)
LOCUS PPH16 7904 bp ds-DNA Circular VRL 16-APR-1991
DEFINITION Human papillomavirus type 16 (HPV16), complete genome.
ACCESSION K02718
KEYWORDS circular; complete genome.
SOURCE Papilloma virus type 16 DNA, isolated from a human invasive
cervical carcinoma.
ORGANISM Human papillomavirus
Viridae; ds-DNA nonenveloped viruses; Papovaviridae;
Papillomavirus.
REFERENCE 1 (bases 1 to 7904)
AUTHORS Seedorf,K., Kraemmer,G., Duerst,M., Suhai,S. and Roewekamp,W.G.
TITLE Human papillomavirus type 16 DNA sequence
JOURNAL Virology 145, 181-185 (1985)
STANDARD full staff_review
REFERENCE 2 (sites)
AUTHORS Kennedy,I.M., Haddow,J.K. and Clements,J.B.
TITLE A negative element in the human poapillomavirus type 16 genome acts
at the level of late mRNA stability
JOURNAL J. Virol. 65, 2093-2097 (1991)
STANDARD full staff_review
COMMENT The sense strand of this double-stranded circular genome is shown,
with a numbering system matching the first 60 bp of HPVa1, HPV6b
and BPV1. The annotation of sites and features is solely based upon
homology comparison with these other papillomaviruses. In addition
to the coding sequences reported below, the authors note open
reading frames which do not start with 'ATG', but which are found
in other papillomaviruses. In particular, a second portion of the
E1 gene may be located out to base 2813 (the E1 protein is thought
to be generally involved in DNA replication).
A potential 'CAT'-box region is found beginning at base 7895 below,
and 'TATA' boxes for early and late transcripts may be located at
17, 65 and 4289. Potential polyadenylation signals are at bases
4213 and 7260.
HPV16, in comparison to HPV types 6 and 11, is more often
associated with malignant genital cancers in humans.
FEATURES Location/Qualifiers
CDS 83..559
/note="E6 (putative)"
/codon_start=83
CDS 562..858
/note="E7 (putative)"
/codon_start=562
CDS 865..1170
/note="E1 (putative)"
/codon_start=865
CDS 2755..3852
/note="E2 (putative)"
/codon_start=2755
CDS 4235..5656
/note="L2 (putative)"
/codon_start=4235
CDS 5559..7154
/note="L1 (putative)"
/codon_start=5559
BASE COUNT 2601 a 1377 c 1509 g 2417 t
ORIGIN Unreported.
1 actacaataa ttcatgtata aaactaaggg cgtaaccgaa atcggttgaa ccgaaaccgg
61 ttagtataaa agcagacatt ttatgcacca aaagagaact gcaatgtttc aggacccaca
121 ggagcgaccc agaaagttac cacagttatg cacagagctg caaacaacta tacatgatat
181 aatattagaa tgtgtgtact gcaagcaaca gttactgcga cgtgaggtat atgactttgc
241 ttttcgggat ttatgcatag tatatagaga tgggaatcca tatgctgtat gtgataaatg
301 tttaaagttt tattctaaaa ttagtgagta tagacattat tgttatagtt tgtatggaac
361 aacattagaa cagcaataca acaaaccgtt gtgtgatttg ttaattaggt gtattaactg
421 tcaaaagcca ctgtgtcctg aagaaaagca aagacatctg gacaaaaagc aaagattcca
481 taatataagg ggtcggtgga ccggtcgatg tatgtcttgt tgcagatcat caagaacacg
541 tagagaaacc cagctgtaat catgcatgga gatacaccta cattgcatga atatatgtta
601 gatttgcaac cagagacaac tgatctctac tgttatgagc aattaaatga cagctcagag
661 gaggaggatg aaatagatgg tccagctgga caagcagaac cggacagagc ccattacaat
721 attgtaacct tttgttgcaa gtgtgactct acgcttcggt tgtgcgtaca aagcacacac
781 gtagacattc gtactttgga agacctgtta atgggcacac taggaattgt gtgccccatc
841 tgttctcaga aaccataatc taccatggct gatcctgcag gtaccaatgg ggaagagggt
901 acgggatgta atggatggtt ttatgtagag gctgtagtgg aaaaaaaaac aggggatgct
961 atatcagatg acgagaacga aaatgacagt gatacaggtg aagatttggt agattttata
1021 gtaaatgata atgattattt aacacaggca gaaacagaga cagcacatgc gttgtttact
1081 gcacaggaag caaaacaaca tagagatgca gtacaggttc taaaacgaaa gtatttggta
1141 gtccacttag tgatattagt ggatgtgtag acaataatat tagtcctaga ttaaaagcta
1201 tatgtataga aaaacaaagt agagctgcaa aaaggagatt atttgaaagc gaagacagcg
1261 ggtatggcaa tactgaagtg gaaactcagc agatgttaca ggtagaaggg cgccatgaga
1321 ctgaaacacc atgtagtcag tatagtggtg gaagtggggg tggttgcagt cagtacagta
1381 gtggaagtgg gggagagggt gttagtgaaa gacacactat atgccaaaca ccacttacaa
1441 atattttaaa tgtactaaaa actagtaatg caaaggcagc aatgttagca aaatttaaag
1501 agttatacgg ggtgagtttt tcagaattag taagaccatt taaaagtaat aaatcaacgt
1561 gttgcgattg gtgtattgct gcatttggac ttacacccag tatagctgac agtataaaaa
1621 cactattaca acaatattgt ttatatttac acattcaaag tttagcatgt tcatggggaa
1681 tggttgtgtt actattagta agatataaat gtggaaaaaa tagagaaaca attgaaaaat
1741 tgctgtctaa actattatgt gtgtctccaa tgtgtatgat gatagagcct ccaaaattgc
1801 gtagtacagc agcagcatta tattggtata aaacaggtat atcaaatatt agtgaagtgt
1861 atggagacac gccagaatgg atacaaagac aaacagtatt acaacatagt tttaatgatt
1921 gtacatttga attatcacag atggtacaat gggcctacga taatgacata gtagacgata
1981 gtgaaattgc atataaatat gcacaattgg cagacactaa tagtaatgca agtgcctttc
2041 taaaaagtaa ttcacaggca aaaattgtaa aggattgtgc aacaatgtgt agacattata
2101 aacgagcaga aaaaaaacaa atgagtatga gtcaatggat aaaatataga tgtgataggg
2161 tagatgatgg aggtgattgg aagcaaattg ttatgttttt aaggtatcaa ggtgtagagt
2221 ttatgtcatt tttaactgca ttaaaaagat ttttgcaagg catacctaaa aaaaattgca
2281 tattactata tggtgcagct aacacaggta aatcattatt tggtatgagt ttaatgaaat
2341 ttctgcaagg gtctgtaata tgttttgtaa attctaaaag ccatttttgg ttacaaccat
2401 tagcagatgc caaaataggt atgttagatg atgctacagt gccctgttgg aactacatag
2461 atgacaattt aagaaatgca ttggatggaa atttagtttc tatggatgta aagcatagac
2521 cattggtaca actaaaatgc cctccattat taattacatc taacattaat gctggtacag
2581 attctaggtg gccttattta cataatagat tggtggtgtt tacatttcct aatgagtttc
2641 catttgacga aaacggaaat ccagtgtatg agcttaatga taagaactgg aaatcctttt
2701 tctcaaggac gtggtccaga ttaagtttgc acgaggacga ggacaaggaa aacgatggag
2761 actctttgcc aacgtttaaa tgtgtgtcag gacaaaatac taacacatta tgaaaatgat
2821 agtacagacc tacgtgacca tatagactat tggaaacaca tgcgcctaga atgtgctatt
2881 tattacaagg ccagagaaat gggatttaaa catattaacc accaagtggt gccaacactg
2941 gctgtatcaa agaataaagc attacaagca attgaactgc aactaacgtt agaaacaata
3001 tataactcac aatatagtaa tgaaaagtgg acattacaag acgttagcct tgaagtgtat
3061 ttaactgcac caacaggatg tataaaaaaa catggatata cagtggaagt gcagtttgat
3121 ggagacatat gcaatacaat gcattataca aactggacac atatatatat ttgtgaagaa
3181 gcatcagtaa ctgtggtaga gggtcaagtt gactattatg gtttatatta tgttcatgaa
3241 ggaatacgaa catattttgt gcagtttaaa gatgatgcag aaaaatatag taaaaataaa
3301 gtatgggaag ttcatgcggg tggtcaggta atattatgtc ctacatctgt gtttagcagc
3361 aacgaagtat cctctcctga aattattagg cagcacttgg ccaaccaccc cgccgcgacc
3421 cataccaaag ccgtcgcctt gggcaccgaa gaaacacaga cgactatcca gcgaccaaga
3481 tcagagccag acaccggaaa cccctgccac accactaagt tgttgcacag agactcagtg
3541 gacagtgctc caatcctcac tgcatttaac agctcacaca aaggacggat taactgtaat
3601 agtaacacta cacccatagt acatttaaaa ggtgatgcta atactttaaa atgtttaaga
3661 tatagattta aaaagcattg tacattgtat actgcagtgt cgtctacatg gcattggaca
3721 ggacataatg taaaacataa aagtgcaatt gttacactta catatgatag tgaatggcaa
3781 cgtgaccaat ttttgtctca agttaaaata ccaaaaacta ttacagtgtc tactggattt
3841 atgtctatat gacaaatctt gatactgcat ccacaacatt actggcgtgc tttttgcttt
3901 gctttgtgtg cttttgtgtg tctgcctatt aatacgtccg ctgcttttgt ctgtgtctac
3961 atacacatca ttaataatat tggtattact attgtggata acagcagcct ctgcgtttag
4021 gtgttttatt gtatatatta tatttgttta tataccatta tttttaatac atacacatgc
4081 acgcttttta attacataat gtatatgtac ataatgtaat tgttacatat aattgttgta
4141 taccataact tactattttt tcttttttat tttcatatat aatttttttt tttgtttgtt
4201 tgtttgtttt ttaataaact gttattactt aacaatgcga cacaaacgtt ctgcaaaacg
4261 cacaaaacgt gcatcggcta cccaacttta taaaacatgc aaacaggcag gtacatgtcc
4321 acctgacatt atacctaagg ttgaaggcaa aactattgct gaacaaatat tacaatatgg
4381 aagtatgggt gtattttttg gtgggttagg aattggaaca gggtcgggta caggcggacg
4441 cactgggtat attccattgg gaacaaggcc tcccacagct acagatacac ttgctcctgt
4501 aagaccccct ttaacagtag atcctgtggg cccttctgat ccttctatag tttctttagt
4561 ggaagaaact agttttattg atgctggtgc accaacatct gtaccttcca ttcccccaga
4621 tgtatcagga tttagtatta ctacttcaac tgataccaca cctgctatat tagatattaa
4681 taatactgtt actactgtta ctacacataa taatcccact ttcactgacc catctgtatt
4741 gcagcctcca acacctgcag aaactggagg gcattttaca ctttcatcat ccactattag
4801 tacacataat tatgaagaaa ttcctatgga tacatttatt gttagcacaa accctaacac
4861 agtaactagt agcacaccca taccagggtc tcgcccagtg gcacgcctag gattatatag
4921 tcgcacaaca caacaggtta aagttgtaga ccctgctttt gtaaccactc ccactaaact
4981 tattacatat gataatcctg catatgaagg tatagatgtg gataatacat tatatttttc
5041 tagtaatgat aatagtatta atatagctcc agatcctgac tttttggata tagttgcttt
5101 acataggcca gcattaacct ctaggcgtac tggcattagg tacagtagaa ttggtaataa
5161 acaaacacta cgtactcgta gtggaaaatc tataggtgct aaggtacatt attattatga
5221 tttaagtact attgatcctg cagaagaaat agaattacaa actataacac cttctacata
5281 tactaccact tcacatgcag cctcacctac ttctattaat aatggattat atgatattta
5341 tgcagatgac tttattacag atacttctac aaccccggta ccatctgtac cctctacatc
5401 tttatcaggt tatattcctg caaatacaac aattcctttt ggtggtgcat acaatattcc
5461 tttagtatca ggtcctgata tacccattaa tataactgac caagctcctt cattaattcc
5521 tatagttcca gggtctccac aatatacaat tattgctgat gcaggtgact tttatttaca
5581 tcctagttat tacatgttac gaaaacgacg taaacgttta ccatattttt tttcagatgt
5641 ctctttggct gcctagtgag gccactgtct acttgcctcc tgtcccagta tctaaggttg
5701 taagcacgga tgaatatgtt gcacgcacaa acatatatta tcatgcagga acatccagac
5761 tacttgcagt tggacatccc tattttccta ttaaaaaacc taacaataac aaaatattag
5821 ttcctaaagt atcaggatta caatacaggg tatttagaat acatttacct gaccccaata
5881 agtttggttt tcctgacacc tcattttata atccagatac acagcggctg gtttgggcct
5941 gtgtaggtgt tgaggtaggt cgtggtcagc cattaggtgt gggcattagt ggccatcctt
6001 tattaaataa attggatgac acagaaaatg ctagtgctta tgcagcaaat gcaggtgtgg
6061 ataatagaga atgtatatct atggattaca aacaaacaca attgtgttta attggttgca
6121 aaccacctat aggggaacac tggggcaaag gatccccatg taccaatgtt gcagtaaatc
6181 caggtgattg tccaccatta gagttaataa acacagttat tcaggatggt gatatggttc
6241 atactggctt tggtgctatg gactttacta cattacaggc taacaaaagt gaagttccac
6301 tggatatttg tacatctatt tgcaaatatc cagattatat taaaatggtg tcagaaccat
6361 atggcgacag cttatttttt tatttacgaa gggaacaaat gtttgttaga catttattta
6421 atagggctgg tactgttggt gaaaatgtac cagacgattt atacattaaa ggctctgggt
6481 ctactgcaaa tttagccagt tcaaattatt ttcctacacc tagtggttct atggttacct
6541 ctgatgccca aatattcaat aaaccttatt ggttacaacg agcacagggc cacaataatg
6601 gcatttgttg gggtaaccaa ctatttgtta ctgttgttga tactacacgc agtacaaata
6661 tgtcattatg tgctgccata tctacttcag aaactacata taaaaatact aactttaagg
6721 agtacctacg acatggggag gaatatgatt tacagtttat ttttcaactg tgcaaaataa
6781 ccttaactgc agacgttatg acatacatac attctatgaa ttccactatt ttggaggact
6841 ggaattttgg tctacaacct cccccaggag gcacactaga agatacttat aggtttgtaa
6901 cccaggcaat tgcttgtcaa aaacatacac ctccagcacc taaagaagat gatcccctta
6961 aaaaatacac tttttgggaa gtaaatttaa aggaaaagtt ttctgcagac ctagatcagt
7021 ttcctttagg acgcaaattt ttactacaag caggattgaa ggccaaacca aaatttacat
7081 taggaaaacg aaaagctaca cccaccacct catctacctc tacaactgct aaacgcaaaa
7141 aacgtaagct gtaagtattg tatgtatgtt gaattagtgt tgtttgttgt gtatatgttt
7201 gtatgtgctt gtatgtgctt gtaaatatta agttgtatgt gtgtttgtat gtatggtata
7261 ataaacacgt gtgtatgtgt ttttaaatgc ttgtgtaact attgtgtcat gcaacataaa
7321 taaacttatt gtttcaacac ctactaattg tgttgtggtt attcattgta tataaactat
7381 atttgctaca tcctgttttt gttttatata tactatattt tgtagcgcca ggcccatttt
7441 gtagcttcaa ccgaattcgg ttgcatgctt tttggcacaa aatgtgtttt tttaaatagt
7501 tctatgtcag caactatggt ttaaacttgt acgtttcctg cttgccatgc gtgccaaatc
7561 cctgttttcc tgacctgcac tgcttgccaa ccattccatt gttttttaca ctgcactatg
7621 tgcaactact gaatcactat gtacattgtg tcatataaaa taaatcacta tgcgccaacg
7681 ccttacatac cgctgttagg cacatatttt tggcttgttt taactaacct aattgcatat
7741 ttggcataag gtttaaactt ctaaggccaa ctaaatgtca ccctagttca tacatgaact
7801 gtgtaaaggt tagtcataca ttgttcattt gtaaaactgc acatgggtgt gtgcaaaccg
7861 attttgggtt acacatttac aagcaactta tataataata ctaa
//