[bionet.molbio.genbank.updates] Human papillomavirus type 16

GenBank-Updates@genbank.bio.net (04/16/91)

LOCUS       PPH16        7904 bp ds-DNA   Circular  VRL       16-APR-1991
DEFINITION  Human papillomavirus type 16 (HPV16), complete genome.
ACCESSION   K02718
KEYWORDS    circular; complete genome.
SOURCE      Papilloma virus type 16 DNA, isolated from a human invasive
            cervical carcinoma.
  ORGANISM  Human papillomavirus
            Viridae; ds-DNA nonenveloped viruses; Papovaviridae;
            Papillomavirus.
REFERENCE   1  (bases 1 to 7904)
  AUTHORS   Seedorf,K., Kraemmer,G., Duerst,M., Suhai,S. and Roewekamp,W.G.
  TITLE     Human papillomavirus type 16 DNA sequence
  JOURNAL   Virology 145, 181-185 (1985)
  STANDARD  full staff_review
REFERENCE   2  (sites)
  AUTHORS   Kennedy,I.M., Haddow,J.K. and Clements,J.B.
  TITLE     A negative element in the human poapillomavirus type 16 genome acts
            at the level of late mRNA stability
  JOURNAL   J. Virol. 65, 2093-2097 (1991)
  STANDARD  full staff_review
COMMENT     The sense strand of this double-stranded circular genome is shown,
            with a numbering system matching the first 60 bp of HPVa1, HPV6b
            and BPV1. The annotation of sites and features is solely based upon
            homology comparison with these other papillomaviruses. In addition
            to the coding sequences reported below, the authors note open
            reading frames which do not start with 'ATG', but which are found
            in other papillomaviruses. In particular, a second portion of the
            E1 gene may be located out to base 2813 (the E1 protein is thought
            to be generally involved in DNA replication).
            
            A potential 'CAT'-box region is found beginning at base 7895 below,
            and 'TATA' boxes for early and late transcripts may be located at
            17, 65 and 4289. Potential polyadenylation signals are at bases
            4213 and 7260.
            
            HPV16, in comparison to HPV types 6 and 11, is more often
            associated with malignant genital cancers in humans.
FEATURES             Location/Qualifiers
     CDS             83..559
                     /note="E6 (putative)"
                     /codon_start=83
     CDS             562..858
                     /note="E7 (putative)"
                     /codon_start=562
     CDS             865..1170
                     /note="E1 (putative)"
                     /codon_start=865
     CDS             2755..3852
                     /note="E2 (putative)"
                     /codon_start=2755
     CDS             4235..5656
                     /note="L2 (putative)"
                     /codon_start=4235
     CDS             5559..7154
                     /note="L1 (putative)"
                     /codon_start=5559
BASE COUNT     2601 a   1377 c   1509 g   2417 t
ORIGIN      Unreported.
        1 actacaataa ttcatgtata aaactaaggg cgtaaccgaa atcggttgaa ccgaaaccgg
       61 ttagtataaa agcagacatt ttatgcacca aaagagaact gcaatgtttc aggacccaca
      121 ggagcgaccc agaaagttac cacagttatg cacagagctg caaacaacta tacatgatat
      181 aatattagaa tgtgtgtact gcaagcaaca gttactgcga cgtgaggtat atgactttgc
      241 ttttcgggat ttatgcatag tatatagaga tgggaatcca tatgctgtat gtgataaatg
      301 tttaaagttt tattctaaaa ttagtgagta tagacattat tgttatagtt tgtatggaac
      361 aacattagaa cagcaataca acaaaccgtt gtgtgatttg ttaattaggt gtattaactg
      421 tcaaaagcca ctgtgtcctg aagaaaagca aagacatctg gacaaaaagc aaagattcca
      481 taatataagg ggtcggtgga ccggtcgatg tatgtcttgt tgcagatcat caagaacacg
      541 tagagaaacc cagctgtaat catgcatgga gatacaccta cattgcatga atatatgtta
      601 gatttgcaac cagagacaac tgatctctac tgttatgagc aattaaatga cagctcagag
      661 gaggaggatg aaatagatgg tccagctgga caagcagaac cggacagagc ccattacaat
      721 attgtaacct tttgttgcaa gtgtgactct acgcttcggt tgtgcgtaca aagcacacac
      781 gtagacattc gtactttgga agacctgtta atgggcacac taggaattgt gtgccccatc
      841 tgttctcaga aaccataatc taccatggct gatcctgcag gtaccaatgg ggaagagggt
      901 acgggatgta atggatggtt ttatgtagag gctgtagtgg aaaaaaaaac aggggatgct
      961 atatcagatg acgagaacga aaatgacagt gatacaggtg aagatttggt agattttata
     1021 gtaaatgata atgattattt aacacaggca gaaacagaga cagcacatgc gttgtttact
     1081 gcacaggaag caaaacaaca tagagatgca gtacaggttc taaaacgaaa gtatttggta
     1141 gtccacttag tgatattagt ggatgtgtag acaataatat tagtcctaga ttaaaagcta
     1201 tatgtataga aaaacaaagt agagctgcaa aaaggagatt atttgaaagc gaagacagcg
     1261 ggtatggcaa tactgaagtg gaaactcagc agatgttaca ggtagaaggg cgccatgaga
     1321 ctgaaacacc atgtagtcag tatagtggtg gaagtggggg tggttgcagt cagtacagta
     1381 gtggaagtgg gggagagggt gttagtgaaa gacacactat atgccaaaca ccacttacaa
     1441 atattttaaa tgtactaaaa actagtaatg caaaggcagc aatgttagca aaatttaaag
     1501 agttatacgg ggtgagtttt tcagaattag taagaccatt taaaagtaat aaatcaacgt
     1561 gttgcgattg gtgtattgct gcatttggac ttacacccag tatagctgac agtataaaaa
     1621 cactattaca acaatattgt ttatatttac acattcaaag tttagcatgt tcatggggaa
     1681 tggttgtgtt actattagta agatataaat gtggaaaaaa tagagaaaca attgaaaaat
     1741 tgctgtctaa actattatgt gtgtctccaa tgtgtatgat gatagagcct ccaaaattgc
     1801 gtagtacagc agcagcatta tattggtata aaacaggtat atcaaatatt agtgaagtgt
     1861 atggagacac gccagaatgg atacaaagac aaacagtatt acaacatagt tttaatgatt
     1921 gtacatttga attatcacag atggtacaat gggcctacga taatgacata gtagacgata
     1981 gtgaaattgc atataaatat gcacaattgg cagacactaa tagtaatgca agtgcctttc
     2041 taaaaagtaa ttcacaggca aaaattgtaa aggattgtgc aacaatgtgt agacattata
     2101 aacgagcaga aaaaaaacaa atgagtatga gtcaatggat aaaatataga tgtgataggg
     2161 tagatgatgg aggtgattgg aagcaaattg ttatgttttt aaggtatcaa ggtgtagagt
     2221 ttatgtcatt tttaactgca ttaaaaagat ttttgcaagg catacctaaa aaaaattgca
     2281 tattactata tggtgcagct aacacaggta aatcattatt tggtatgagt ttaatgaaat
     2341 ttctgcaagg gtctgtaata tgttttgtaa attctaaaag ccatttttgg ttacaaccat
     2401 tagcagatgc caaaataggt atgttagatg atgctacagt gccctgttgg aactacatag
     2461 atgacaattt aagaaatgca ttggatggaa atttagtttc tatggatgta aagcatagac
     2521 cattggtaca actaaaatgc cctccattat taattacatc taacattaat gctggtacag
     2581 attctaggtg gccttattta cataatagat tggtggtgtt tacatttcct aatgagtttc
     2641 catttgacga aaacggaaat ccagtgtatg agcttaatga taagaactgg aaatcctttt
     2701 tctcaaggac gtggtccaga ttaagtttgc acgaggacga ggacaaggaa aacgatggag
     2761 actctttgcc aacgtttaaa tgtgtgtcag gacaaaatac taacacatta tgaaaatgat
     2821 agtacagacc tacgtgacca tatagactat tggaaacaca tgcgcctaga atgtgctatt
     2881 tattacaagg ccagagaaat gggatttaaa catattaacc accaagtggt gccaacactg
     2941 gctgtatcaa agaataaagc attacaagca attgaactgc aactaacgtt agaaacaata
     3001 tataactcac aatatagtaa tgaaaagtgg acattacaag acgttagcct tgaagtgtat
     3061 ttaactgcac caacaggatg tataaaaaaa catggatata cagtggaagt gcagtttgat
     3121 ggagacatat gcaatacaat gcattataca aactggacac atatatatat ttgtgaagaa
     3181 gcatcagtaa ctgtggtaga gggtcaagtt gactattatg gtttatatta tgttcatgaa
     3241 ggaatacgaa catattttgt gcagtttaaa gatgatgcag aaaaatatag taaaaataaa
     3301 gtatgggaag ttcatgcggg tggtcaggta atattatgtc ctacatctgt gtttagcagc
     3361 aacgaagtat cctctcctga aattattagg cagcacttgg ccaaccaccc cgccgcgacc
     3421 cataccaaag ccgtcgcctt gggcaccgaa gaaacacaga cgactatcca gcgaccaaga
     3481 tcagagccag acaccggaaa cccctgccac accactaagt tgttgcacag agactcagtg
     3541 gacagtgctc caatcctcac tgcatttaac agctcacaca aaggacggat taactgtaat
     3601 agtaacacta cacccatagt acatttaaaa ggtgatgcta atactttaaa atgtttaaga
     3661 tatagattta aaaagcattg tacattgtat actgcagtgt cgtctacatg gcattggaca
     3721 ggacataatg taaaacataa aagtgcaatt gttacactta catatgatag tgaatggcaa
     3781 cgtgaccaat ttttgtctca agttaaaata ccaaaaacta ttacagtgtc tactggattt
     3841 atgtctatat gacaaatctt gatactgcat ccacaacatt actggcgtgc tttttgcttt
     3901 gctttgtgtg cttttgtgtg tctgcctatt aatacgtccg ctgcttttgt ctgtgtctac
     3961 atacacatca ttaataatat tggtattact attgtggata acagcagcct ctgcgtttag
     4021 gtgttttatt gtatatatta tatttgttta tataccatta tttttaatac atacacatgc
     4081 acgcttttta attacataat gtatatgtac ataatgtaat tgttacatat aattgttgta
     4141 taccataact tactattttt tcttttttat tttcatatat aatttttttt tttgtttgtt
     4201 tgtttgtttt ttaataaact gttattactt aacaatgcga cacaaacgtt ctgcaaaacg
     4261 cacaaaacgt gcatcggcta cccaacttta taaaacatgc aaacaggcag gtacatgtcc
     4321 acctgacatt atacctaagg ttgaaggcaa aactattgct gaacaaatat tacaatatgg
     4381 aagtatgggt gtattttttg gtgggttagg aattggaaca gggtcgggta caggcggacg
     4441 cactgggtat attccattgg gaacaaggcc tcccacagct acagatacac ttgctcctgt
     4501 aagaccccct ttaacagtag atcctgtggg cccttctgat ccttctatag tttctttagt
     4561 ggaagaaact agttttattg atgctggtgc accaacatct gtaccttcca ttcccccaga
     4621 tgtatcagga tttagtatta ctacttcaac tgataccaca cctgctatat tagatattaa
     4681 taatactgtt actactgtta ctacacataa taatcccact ttcactgacc catctgtatt
     4741 gcagcctcca acacctgcag aaactggagg gcattttaca ctttcatcat ccactattag
     4801 tacacataat tatgaagaaa ttcctatgga tacatttatt gttagcacaa accctaacac
     4861 agtaactagt agcacaccca taccagggtc tcgcccagtg gcacgcctag gattatatag
     4921 tcgcacaaca caacaggtta aagttgtaga ccctgctttt gtaaccactc ccactaaact
     4981 tattacatat gataatcctg catatgaagg tatagatgtg gataatacat tatatttttc
     5041 tagtaatgat aatagtatta atatagctcc agatcctgac tttttggata tagttgcttt
     5101 acataggcca gcattaacct ctaggcgtac tggcattagg tacagtagaa ttggtaataa
     5161 acaaacacta cgtactcgta gtggaaaatc tataggtgct aaggtacatt attattatga
     5221 tttaagtact attgatcctg cagaagaaat agaattacaa actataacac cttctacata
     5281 tactaccact tcacatgcag cctcacctac ttctattaat aatggattat atgatattta
     5341 tgcagatgac tttattacag atacttctac aaccccggta ccatctgtac cctctacatc
     5401 tttatcaggt tatattcctg caaatacaac aattcctttt ggtggtgcat acaatattcc
     5461 tttagtatca ggtcctgata tacccattaa tataactgac caagctcctt cattaattcc
     5521 tatagttcca gggtctccac aatatacaat tattgctgat gcaggtgact tttatttaca
     5581 tcctagttat tacatgttac gaaaacgacg taaacgttta ccatattttt tttcagatgt
     5641 ctctttggct gcctagtgag gccactgtct acttgcctcc tgtcccagta tctaaggttg
     5701 taagcacgga tgaatatgtt gcacgcacaa acatatatta tcatgcagga acatccagac
     5761 tacttgcagt tggacatccc tattttccta ttaaaaaacc taacaataac aaaatattag
     5821 ttcctaaagt atcaggatta caatacaggg tatttagaat acatttacct gaccccaata
     5881 agtttggttt tcctgacacc tcattttata atccagatac acagcggctg gtttgggcct
     5941 gtgtaggtgt tgaggtaggt cgtggtcagc cattaggtgt gggcattagt ggccatcctt
     6001 tattaaataa attggatgac acagaaaatg ctagtgctta tgcagcaaat gcaggtgtgg
     6061 ataatagaga atgtatatct atggattaca aacaaacaca attgtgttta attggttgca
     6121 aaccacctat aggggaacac tggggcaaag gatccccatg taccaatgtt gcagtaaatc
     6181 caggtgattg tccaccatta gagttaataa acacagttat tcaggatggt gatatggttc
     6241 atactggctt tggtgctatg gactttacta cattacaggc taacaaaagt gaagttccac
     6301 tggatatttg tacatctatt tgcaaatatc cagattatat taaaatggtg tcagaaccat
     6361 atggcgacag cttatttttt tatttacgaa gggaacaaat gtttgttaga catttattta
     6421 atagggctgg tactgttggt gaaaatgtac cagacgattt atacattaaa ggctctgggt
     6481 ctactgcaaa tttagccagt tcaaattatt ttcctacacc tagtggttct atggttacct
     6541 ctgatgccca aatattcaat aaaccttatt ggttacaacg agcacagggc cacaataatg
     6601 gcatttgttg gggtaaccaa ctatttgtta ctgttgttga tactacacgc agtacaaata
     6661 tgtcattatg tgctgccata tctacttcag aaactacata taaaaatact aactttaagg
     6721 agtacctacg acatggggag gaatatgatt tacagtttat ttttcaactg tgcaaaataa
     6781 ccttaactgc agacgttatg acatacatac attctatgaa ttccactatt ttggaggact
     6841 ggaattttgg tctacaacct cccccaggag gcacactaga agatacttat aggtttgtaa
     6901 cccaggcaat tgcttgtcaa aaacatacac ctccagcacc taaagaagat gatcccctta
     6961 aaaaatacac tttttgggaa gtaaatttaa aggaaaagtt ttctgcagac ctagatcagt
     7021 ttcctttagg acgcaaattt ttactacaag caggattgaa ggccaaacca aaatttacat
     7081 taggaaaacg aaaagctaca cccaccacct catctacctc tacaactgct aaacgcaaaa
     7141 aacgtaagct gtaagtattg tatgtatgtt gaattagtgt tgtttgttgt gtatatgttt
     7201 gtatgtgctt gtatgtgctt gtaaatatta agttgtatgt gtgtttgtat gtatggtata
     7261 ataaacacgt gtgtatgtgt ttttaaatgc ttgtgtaact attgtgtcat gcaacataaa
     7321 taaacttatt gtttcaacac ctactaattg tgttgtggtt attcattgta tataaactat
     7381 atttgctaca tcctgttttt gttttatata tactatattt tgtagcgcca ggcccatttt
     7441 gtagcttcaa ccgaattcgg ttgcatgctt tttggcacaa aatgtgtttt tttaaatagt
     7501 tctatgtcag caactatggt ttaaacttgt acgtttcctg cttgccatgc gtgccaaatc
     7561 cctgttttcc tgacctgcac tgcttgccaa ccattccatt gttttttaca ctgcactatg
     7621 tgcaactact gaatcactat gtacattgtg tcatataaaa taaatcacta tgcgccaacg
     7681 ccttacatac cgctgttagg cacatatttt tggcttgttt taactaacct aattgcatat
     7741 ttggcataag gtttaaactt ctaaggccaa ctaaatgtca ccctagttca tacatgaact
     7801 gtgtaaaggt tagtcataca ttgttcattt gtaaaactgc acatgggtgt gtgcaaaccg
     7861 attttgggtt acacatttac aagcaactta tataataata ctaa
//