[bionet.molbio.genbank.updates] Xenopus laevis histone gene cluster XlH3-A with genes H1A, H2B, H3

GenBank-Updates@genbank.bio.net (05/29/91)

LOCUS       XELHISH3A    8592 bp ds-DNA             VRT       28-MAY-1991
DEFINITION  Xenopus laevis histone gene cluster XlH3-A with genes H1A, H2B, H3
            and H4
ACCESSION   X03018
KEYWORDS    histone; histone H1A; histone H2A; histone H2B; histone H3;
            histone H4; inverted repeat; tandem repeat.
SOURCE      Xenopus laevis DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 8592)
  AUTHORS   Perry,M., Thomsen,G.H. and Roeder,R.G.
  TITLE     Genomic organization and nucleotide sequence of two distinct
            histone gene clusters from Xenopus laevis - Identification of novel
            conserved upstream sequence elements
  JOURNAL   J. Mol. Biol. 185, 479-499 (1985)
  STANDARD  full automatic
COMMENT     SWISS-PROT; P02281; H2B1$XENLA. SWISS-PROT; P02304; H4$HUMAN.
            SWISS-PROT; P06892; H1A$XENLA. SWISS-PROT; P06897; H2A1$XENLA.
            SWISS-PROT; P16105; H32$BOVIN.
            
            Histone genes contain a highly conserved region consisting of an
            inverted repeat (FT: INVREP A-E) required for accurate processing
            of the 3'end of the histone mRNA. The alternating purine-pyrimidine
            stretches might be expected in the left-handed Z-DNA conformation
            under conditions of topological stress.
            
            From EMBL    entry XLHISH3A;  dated 22-DEC-1988.
FEATURES             Location/Qualifiers
     promoter        522..526
                     /note="CCAAT box (H1A)"
     promoter        546..552
                     /note="TATAA box (H1A)"
     misc_RNA        577..582
                     /note="put. CAP site (H1A)"
     CDS             615..1244
                     /note="histone H1A (aa 1-210)"
                     /codon_start=615
     promoter        1577..1581
                     /note="CCAAT box (H2B)"
     promoter        1623..1629
                     /note="TATAA box (H2B)"
     misc_RNA        1649..1654
                     /note="put. CAP site (H2B)"
     CDS             1695..2072
                     /note="histone H2B (aa 1-126)"
                     /codon_start=1695
     promoter        4275..4281
                     /note="CCAAT box (H2A)"
     promoter        4309..4314
                     /note="TATAA box (H2A)"
     misc_RNA        4344..4349
                     /note="put. CAP site (H2A)"
     CDS             4372..4761
                     /note="histone H2A (aa 1-130)"
                     /codon_start=4372
     CDS             complement(5555..5962)
                     /note="histone H3 (aa 1-136)"
                     /codon_start=5962
     misc_RNA        complement(5989..5994)
                     /note="put. CAP site"
     promoter        complement(6018..6025)
                     /note="TATAA box (H3)"
     promoter        complement(6042..6046)
                     /note="CCAAT box (H3)"
     promoter        6842..6848
                     /note="CCAAT box homologue sequence (H4)"
     promoter        6885..6891
                     /note="TATAA box (H4)"
     misc_RNA        6919..6923
                     /note="put. CAP site (H4)"
     CDS             6949..7257
                     /note="histone H4 (aa 1-103)"
                     /codon_start=6949
BASE COUNT     2226 a   2244 c   2039 g   2083 t
ORIGIN
        1 ccatggtgta acgaatgaca aaacacaatg cacacgagta tgaaggctgc agggactgga
       61 ctgcagcaga ttcgcttctg tgctactttc ctgccagtgc aaagcccgcg gatttctttt
      121 gtagcctcca aagccctcgc tgtcccactg atatcttgcc ccataacctt tcctctctct
      181 ggaataagaa ctggaggaaa tggatggaag tgagcaagtc aaatgaaagc cttaagagaa
      241 tatcccccag atagacagga gtgtgcctta gatatcaggg ctgttttggg acaatcgctg
      301 caaccagtat gcagaaaatc gctgtcagag acaggacgtt tcccaagcag gcgactgtac
      361 aatcactggg aaacgcttgg aagtcgattt tattaataac tttgcttatt gagagcctgg
      421 aagcacagaa tgaaagctcc ctaaaagccc gacacggaca agaaaataat ggcgtgacta
      481 cgctttgtcc aattagaact caattttaca ataaaactga gccaatcaac agacagaaca
      541 ccttgtatat aaggagaagt ggaaagtcca agctccgtgt ttatcttttg taaaagaacg
      601 acagagaatc tgcaatggct gaagccgccg aatccgcgcc cgctcctccc ccggctgagc
      661 ccgcggccaa gaaaaagaaa cagcagccca agaaagcagc agcagcacgg ggggccgcta
      721 aatccaagaa gccctcgtct ggacccagtg tgtccgagca gatcgtcaca gccgtgtccg
      781 cttccaagga gcgcagcggg gtgtctctgg cagcgctcaa gaagactctg gctgcgggag
      841 gctacgatgt ggacaagaac aacagccgcc tcaagctggc tctcaaggtc accaaggaga
      901 ccctgctcca agtcaaaggc agcggagcct ccggttcctt caagctcaac aagaagcagc
      961 tgcagagcaa ggacaaggcc gccgccaaga agaaggcgcc gctagcagcg gaagccaaga
     1021 aaccagcggc agcagccaag aagacagcca agtccccgaa gaagcccaag aaggtctcgg
     1081 cagccgccaa gagcccaaag aagctcaaga aacccgcaaa ggccgccaag agtcccgcta
     1141 aaaagaccgc cgtcaagccc aaagttgctg ccaaaagccc cgcaaaggcc aaagcagcca
     1201 aacccaaagt ggccaaagcc aagaaagccg cccccaagaa gaaatgagca gctcgctcgc
     1261 tcgctcacta tagtggccaa ttcaaccaag gctcttttaa gagccaccac acccccctga
     1321 aagagcttac aacttcccgc gtctcctgct tctaccacaa gtctccctac ataccgtaat
     1381 attttctcac taacaccact acaagttccc acatgtagtc gtttagtggt gccggccgcc
     1441 gcgagtacca actcggctta aacattctat tcggagggag gagggcggag agagttaatg
     1501 gacgttgccg ggaagcttta cttaccacca atcgtctgga gaaaagctgc gctttgacgt
     1561 catgccacag agccagccaa tgggaatcag tgtcacggcg ccagtgcttt acatgggcag
     1621 ggtataaaag cagctgcagc cggagcagca cttcatcgtt tgctttatag actcatcctg
     1681 tctagttgct gaacatgcct gagcccgcca aatccgctcc agccccaaag aagggctcca
     1741 aaaaagccgt cactaaaacc cagaagaagg atggcaagaa gcgtaggaag agcaggaaag
     1801 agagctacgc catctacgtg tacaaagtgc tcaagcaggt gcaccccgat accggcatct
     1861 cttccaaggc catgagcatc atgaactcct ttgttaacga tgtcttcgag cgcatcgcag
     1921 gggaagcctc ccgcctggct cactacaaca agcgctccac catcacctcc cgggagatcc
     1981 agaccgcggt ccgcctgctc ttgcctgggg agctggccaa gcacgccgtg tccgagggca
     2041 ccaaggccgt caccaagtac accagcgcca agtaatctct ctctccccat tccctgcccc
     2101 acaaacccaa aggctctttt cagagccacc cacctcctct gtacaagggc tgcacctagc
     2161 ttccactttc atccagagtc gcttagtatt tacattcaac ttctatctag aagatttaca
     2221 aacacccttc tgtgaagagg ctttcaggcc acggtctact agtttaacgt ctgaagcctt
     2281 ccttactccg tgcagtattt gcctaaaaac aggatttggc tcttttcatc actagaacaa
     2341 aacgaccaca acgcttctgc acggttttca cttgatggct ttacgtcacc gttgtttcag
     2401 tccacgcatt agaacacaac agatgaggca ccgacacact ccaagcctgc acttgtgatc
     2461 cttaccagcg cggcccattc agcttcttgg ggcaaagaga acacgctgct gattccactg
     2521 agcccgccca aaatgaagaa gctcctttat aaacccgtac agaattggct aaatccattg
     2581 gtagcctttt aaaccatgac atggccacag agtagttcca aggaaaccat ttcaaagcca
     2641 cccctgctgc tggctgatga cgttttggaa actacgctag aagtcaaagg agcccttctt
     2701 caacctgctt gctccattcc tctgccctgt gtatgttttt cacgtacata tttctttgta
     2761 acacacacac acacacacac acacacacac acacacacac acacacacac acacacatga
     2821 atcagttcct cccttaggtt aacccttagc actgcctagt cctaggccgg agttctagtt
     2881 ggatcagttt tgccattaat ccgctttctg gagggctccg ggtcggattt cctttggcgt
     2941 cagatgactg aagtggagta taagcgggtg gtctctacct gctctctagt tgtgctccat
     3001 gtaacacaat ggttgttcac cttccaaaca ccttttccaa tctagttttt ttcacattcc
     3061 tcaccagaaa caaagacttt cttcaattac cttctatttt ttattgtttt tctaaagttg
     3121 aagtttaaaa acgtgaatgt ccctggcctt tcagtctggc agctcagtta ttcaggcgca
     3181 gattctgaac tgttacaact ttgcaacatt tgggtataac aattggttga tgcaaatttc
     3241 agcaacatcg ctggtgaatt agcaactatt gtatcaattc tgactgctgc ctgtaatcaa
     3301 ggaaactcag ggattctgcg ctgcagggac aaacagaaga aatgtatcaa tttagaaagg
     3361 agtctaaatt agagtcagcg accccctcct ttcatagcta ctttagaatg taaaaaatgt
     3421 aaactttaca cttccgttct atttgtgatg tttttctaat attggctgca aaaagtcttc
     3481 atttctagtg aagcatctga aaacaactaa actggaaata gtttggaagg tgacgaaccc
     3541 tttttaatgc tttagagacg agacttaggt ttatagatat gtacgaaata caagagatca
     3601 tataggggta cagccttttc aagaaataga aaagggccgt tttgatcaga atacagaagc
     3661 gagggtttta acctcttcct cgctctcatc cccgttgctc tacagattta cttgacctga
     3721 tagggaactg caaggactgt ctccacccct tattccctag cgtttcaaag gaatcacttg
     3781 gcggcttccg attggctggc gtgtgatttt gaacctgaac acgattgccc gccaaaagga
     3841 ttgcgttaac tggctacagt acagtaacat cacagctttg tccccagcat tgcgccatga
     3901 aattcgatcc tactcaatcg cctccgctgg gatagacttt gtgcaggttt catgtattta
     3961 tctatttatc tggtacatcc tttgcagcaa aatcgctcca cttctccttt actcctttcc
     4021 ccaccgcccc actaataggc gcaacccaac gatcctctca gctgagctcg gtgttcctct
     4081 ggccgtttgt tacttctcta aagatttcca gtaatgagcg gataaaagta gcgctttcca
     4141 cattgttaca ctgggacatt ttcgccgcgt tactcgagcg agatacaact cgcaacaagg
     4201 tttcccgcaa ttcagcgcgc tccccccccc ccccccccca aagcttgtgt ttcgttggct
     4261 gcgaggaagc ctaaccaatc ggcagagaga aggactctcg ctgctgacta taaaaagaaa
     4321 gtaccgctac actataggcg aatcattttg tctaacgaag tcgtttgaag catgtcagga
     4381 agaggcaaac aaggcggaaa gacccgggcc aaggccaaga ctcgctcatc tcgggccggc
     4441 cttcagttcc cggtcggccg tgttcaccgg ctcttgagaa aaggcaatta tgccgagcgg
     4501 gtgggagccg gagctccagt ctaccttgcc gcggtgctcg agtatctgac cgccgagatc
     4561 ctcgagttgg ccggcaacgc tgcccgggat aacaagaaga ctcggatcat ccccaggcac
     4621 ctgcagctcg ccgtgcgcaa cgacgaggag ctcaacaaac tgctcggagg ggtcaccatc
     4681 gcccagggcg gtgtcctgcc caacatccag tctgtgctgc tccccaagaa aaccgagagc
     4741 gccaaatctg ccaagagcaa gtgagctctt ccctccaaaa atactactgc cctgaccact
     4801 cacccaaagg ctcttttcag agccacccac ttcatctaaa cagcgctgta tagccccgtg
     4861 tgtgtgtgtg tgtgtgtgtg tgtgtgtcta aatagtgaga ataaattgtg acacataaga
     4921 gggtttccct tgtacacttt ttcctcactt gattgccagg gaaatgatgt ggtgggagac
     4981 gggcagctat tttgccgaga ttactgcccc ctaatacgag atttttgtga tgagggaaga
     5041 gttggtctag ggactccgaa actccaatct tctcagtgtg agcaatgaac gcgtgtgtgt
     5101 gtgtgtatgc gcaggagaaa aacaaattaa taaaaccgtc tgaaaagtgt gtgattactc
     5161 tcgggctgtg aagtgaaggc cactcatgat cattgtcgta caagtccatt caactaggac
     5221 gttttgcctg agccgcgcga agtgcttgtg cccgattgaa gcagcacaaa agccaggctg
     5281 tcgggtcacc cggtgctgct agagcttggc ttcttttcgg aggattaggt agaaagtgaa
     5341 tgaggaaccc ccctccgcct cgattaaggg aacgaaggat ttgcaccttg gccgcctggt
     5401 aattgtgtgt tgatatttgt acatggggga agatgtgtga agccctgata acagtggagc
     5461 acagctgatt gtcagacagt gttggtggct ctgaaaaaag agccttttgt gtgagggagt
     5521 gagcgagcga tggagacggg gaggagtgag tctaagccct ctctcctcgg atcctgcggg
     5581 ccagctggat gtccttgggc atgatggtga ctctcttggc gtggatggcg cacaggttgg
     5641 tgtcctcgaa gagccccacc agataagcct cgctggcctc ctgcagagcc atgacggctg
     5701 agctctggaa gcggagatcg gtcttgaagt cctgggcgat ctcccgaact aaacgctgga
     5761 aaggcagttt gcgaatgagc agctcggtgg atttctggta acggcgaatt tcccggagag
     5821 ccacggtgcc tgggcgatag cgatggggtt tcttgactcc gccggtggcc ggggcgctct
     5881 tgcgggctgc cttggtggcc aactgcttgc ggggagcctt cccgccggtg gatttgcggg
     5941 cggtctgctt ggtacgagcc atgtcaacag aaagcttttc actaactcca ataccagcct
     6001 cacggccgat ttctcccttt tattagaagc gccttaatac cattggctct ttttttaacc
     6061 acacccctcc ttcttcatct ttgattggtt cttagccgtt tcaaactgcc cgctaaaagt
     6121 tacagatttt aagcgaggta ctttttcctc atacaaaacc tttttttttt tttctttttt
     6181 ctttctctgc atactcattt cctgaagtcg acttgaaatt ccgaagatcg accgggtttc
     6241 ctcgttattt actttccgga atagaaacgg ccgctgtatg ggggctggga ggtgttactg
     6301 atttcacacg gtggcagcgc tgagagctaa tgcgaacgtc agatctttca atcccttcac
     6361 aatacaaaga aatggccagt tagcagatga gttccagccc cagccacacc tagtagaata
     6421 cttggggacg aacatgttgg gcaggggaac gccaagtgct cgtataggct gaagctgcct
     6481 gatcaacgac aagttacaca cagacgaatg tgatcatagt gactgtcact tgggcaccag
     6541 ttaaatggac tatatagtaa aagactcaat cgccatatta tgcccccctt cccccttccc
     6601 ccttccccct tcccccttcc ccccccaata acatactttg tgcgctttac agagcgaatg
     6661 tcccagtgta acaactgtcg agtaggtatc atatttacaa attagcacga tagatttggc
     6721 gggcacttta caatgatggg aaggtttgac gctgaaaaga cgttttagga atgactactg
     6781 gaggttgacg tttatccaat cacgagttaa cacatcgaac tcgtagaagc cgatgggaaa
     6841 ggggcggggt gtgcttgctg ttccctatca gtcagcccag tgtgtatatg atttagggtc
     6901 tgagcgcctg gactcacaca ttcgataagg actttagtga gtgaaatcat gtctggaaga
     6961 ggcaagggcg gaaagggtct gggcaaagga ggcgccaaac gtcacaggaa ggtgctgcgg
     7021 gataacatcc agggcatcac caagcccgcc atccgccgcc tcgcacgcag agggggagtc
     7081 aagcgcatct ccggcctcat ctacgaggag actcgcgggg tgctcaaagt tttcctggag
     7141 aacgttatcc gggacgccgt cacctacacc gagcacgcca agaggaagac cgttaccgct
     7201 atggatgtgg tgtatgctct gaagcgccaa ggacgcactc tgtacggatt cggaggttaa
     7261 ggctcgctaa ggttttttat atttttcccc tcatcaaaat aacggccctt ttaagggcca
     7321 cccacctatt ccttcaaaag gctgcatccg tctatttaga tctattgttt gaagttgtat
     7381 accgtaatat cgaaatcttt cttattaacc gcttgtacaa gagtcggtgc cgcccgcatg
     7441 tctctacaca gtggctgcta tagtaccgag tggatgggga aataaggctg ctttgctcat
     7501 aatgagggag gagaggccaa tcgagcctcc ttacaatcaa atgctatagc tgatgttcta
     7561 gcggcggttt tgtagcagct caggagccag agaaacagtg gactgacacg tgatcgctac
     7621 aatacaggcg gcggcggcgg cggcggctaa aaacctgctc tttgtgctgc tgctgctgct
     7681 gctgctgctg ctgctgctgc tacaggacac ggtcatttgg ggaagggaag tgtggatctg
     7741 ggatgtgtag aatcgcaaga acgttacaat gcatggatga ttttaccagt gcgatttcta
     7801 ctcactgaca ataaaatata cacggtgtag ccataggcat tcgcggtgtc attttggcgg
     7861 cctgggagat aaaaagcggc ttctcgcaac tcaaaggaca aagggacttg ttgtttctct
     7921 tcctgctccg ccctcacacg gtaaggacgg gaactgcagc cagacgctcc ccactagcag
     7981 acaagtctcc tggatactac actgcaataa cttactgttg cacagttcag caactattca
     8041 ttgtcattcc atccatcaac tatgaagagg catagacata gcagctgtaa aataagcacc
     8101 tcctatgaaa taatgcaaag aactagaagt tctgtgccag caatccctta taaactctaa
     8161 tctttataca ccacaaggta gagctctatc agacattcat gaggcctata tgggattata
     8221 ctaatctctt gaccacttct aatatactat cctgttatga ccctgtggtg tttatacaac
     8281 tggcctgcag atgtcactgt gctcaagact tatactgggc atactttcta tacagcttgt
     8341 ggtgttagag ttgcctcctg ttgtgtaatg cctgcatgag tctgctaata aagcactgtt
     8401 atactctact catccttggc taccaaaaca taacaaattg aggatgagat ggctcttcta
     8461 cctctgcagc aggctgcaga ttttcttcca aaccctggtg agcctaccat acctttaact
     8521 gcttggatcc gtatgtttca aaattacgtt attgctgctg accaagggga gatttctgct
     8581 gctagaaagc tt
//