GenBank-Updates@genbank.bio.net (05/29/91)
LOCUS XELHISH3A 8592 bp ds-DNA VRT 28-MAY-1991
DEFINITION Xenopus laevis histone gene cluster XlH3-A with genes H1A, H2B, H3
and H4
ACCESSION X03018
KEYWORDS histone; histone H1A; histone H2A; histone H2B; histone H3;
histone H4; inverted repeat; tandem repeat.
SOURCE Xenopus laevis DNA.
ORGANISM Xenopus laevis
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia;
Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE 1 (bases 1 to 8592)
AUTHORS Perry,M., Thomsen,G.H. and Roeder,R.G.
TITLE Genomic organization and nucleotide sequence of two distinct
histone gene clusters from Xenopus laevis - Identification of novel
conserved upstream sequence elements
JOURNAL J. Mol. Biol. 185, 479-499 (1985)
STANDARD full automatic
COMMENT SWISS-PROT; P02281; H2B1$XENLA. SWISS-PROT; P02304; H4$HUMAN.
SWISS-PROT; P06892; H1A$XENLA. SWISS-PROT; P06897; H2A1$XENLA.
SWISS-PROT; P16105; H32$BOVIN.
Histone genes contain a highly conserved region consisting of an
inverted repeat (FT: INVREP A-E) required for accurate processing
of the 3'end of the histone mRNA. The alternating purine-pyrimidine
stretches might be expected in the left-handed Z-DNA conformation
under conditions of topological stress.
From EMBL entry XLHISH3A; dated 22-DEC-1988.
FEATURES Location/Qualifiers
promoter 522..526
/note="CCAAT box (H1A)"
promoter 546..552
/note="TATAA box (H1A)"
misc_RNA 577..582
/note="put. CAP site (H1A)"
CDS 615..1244
/note="histone H1A (aa 1-210)"
/codon_start=615
promoter 1577..1581
/note="CCAAT box (H2B)"
promoter 1623..1629
/note="TATAA box (H2B)"
misc_RNA 1649..1654
/note="put. CAP site (H2B)"
CDS 1695..2072
/note="histone H2B (aa 1-126)"
/codon_start=1695
promoter 4275..4281
/note="CCAAT box (H2A)"
promoter 4309..4314
/note="TATAA box (H2A)"
misc_RNA 4344..4349
/note="put. CAP site (H2A)"
CDS 4372..4761
/note="histone H2A (aa 1-130)"
/codon_start=4372
CDS complement(5555..5962)
/note="histone H3 (aa 1-136)"
/codon_start=5962
misc_RNA complement(5989..5994)
/note="put. CAP site"
promoter complement(6018..6025)
/note="TATAA box (H3)"
promoter complement(6042..6046)
/note="CCAAT box (H3)"
promoter 6842..6848
/note="CCAAT box homologue sequence (H4)"
promoter 6885..6891
/note="TATAA box (H4)"
misc_RNA 6919..6923
/note="put. CAP site (H4)"
CDS 6949..7257
/note="histone H4 (aa 1-103)"
/codon_start=6949
BASE COUNT 2226 a 2244 c 2039 g 2083 t
ORIGIN
1 ccatggtgta acgaatgaca aaacacaatg cacacgagta tgaaggctgc agggactgga
61 ctgcagcaga ttcgcttctg tgctactttc ctgccagtgc aaagcccgcg gatttctttt
121 gtagcctcca aagccctcgc tgtcccactg atatcttgcc ccataacctt tcctctctct
181 ggaataagaa ctggaggaaa tggatggaag tgagcaagtc aaatgaaagc cttaagagaa
241 tatcccccag atagacagga gtgtgcctta gatatcaggg ctgttttggg acaatcgctg
301 caaccagtat gcagaaaatc gctgtcagag acaggacgtt tcccaagcag gcgactgtac
361 aatcactggg aaacgcttgg aagtcgattt tattaataac tttgcttatt gagagcctgg
421 aagcacagaa tgaaagctcc ctaaaagccc gacacggaca agaaaataat ggcgtgacta
481 cgctttgtcc aattagaact caattttaca ataaaactga gccaatcaac agacagaaca
541 ccttgtatat aaggagaagt ggaaagtcca agctccgtgt ttatcttttg taaaagaacg
601 acagagaatc tgcaatggct gaagccgccg aatccgcgcc cgctcctccc ccggctgagc
661 ccgcggccaa gaaaaagaaa cagcagccca agaaagcagc agcagcacgg ggggccgcta
721 aatccaagaa gccctcgtct ggacccagtg tgtccgagca gatcgtcaca gccgtgtccg
781 cttccaagga gcgcagcggg gtgtctctgg cagcgctcaa gaagactctg gctgcgggag
841 gctacgatgt ggacaagaac aacagccgcc tcaagctggc tctcaaggtc accaaggaga
901 ccctgctcca agtcaaaggc agcggagcct ccggttcctt caagctcaac aagaagcagc
961 tgcagagcaa ggacaaggcc gccgccaaga agaaggcgcc gctagcagcg gaagccaaga
1021 aaccagcggc agcagccaag aagacagcca agtccccgaa gaagcccaag aaggtctcgg
1081 cagccgccaa gagcccaaag aagctcaaga aacccgcaaa ggccgccaag agtcccgcta
1141 aaaagaccgc cgtcaagccc aaagttgctg ccaaaagccc cgcaaaggcc aaagcagcca
1201 aacccaaagt ggccaaagcc aagaaagccg cccccaagaa gaaatgagca gctcgctcgc
1261 tcgctcacta tagtggccaa ttcaaccaag gctcttttaa gagccaccac acccccctga
1321 aagagcttac aacttcccgc gtctcctgct tctaccacaa gtctccctac ataccgtaat
1381 attttctcac taacaccact acaagttccc acatgtagtc gtttagtggt gccggccgcc
1441 gcgagtacca actcggctta aacattctat tcggagggag gagggcggag agagttaatg
1501 gacgttgccg ggaagcttta cttaccacca atcgtctgga gaaaagctgc gctttgacgt
1561 catgccacag agccagccaa tgggaatcag tgtcacggcg ccagtgcttt acatgggcag
1621 ggtataaaag cagctgcagc cggagcagca cttcatcgtt tgctttatag actcatcctg
1681 tctagttgct gaacatgcct gagcccgcca aatccgctcc agccccaaag aagggctcca
1741 aaaaagccgt cactaaaacc cagaagaagg atggcaagaa gcgtaggaag agcaggaaag
1801 agagctacgc catctacgtg tacaaagtgc tcaagcaggt gcaccccgat accggcatct
1861 cttccaaggc catgagcatc atgaactcct ttgttaacga tgtcttcgag cgcatcgcag
1921 gggaagcctc ccgcctggct cactacaaca agcgctccac catcacctcc cgggagatcc
1981 agaccgcggt ccgcctgctc ttgcctgggg agctggccaa gcacgccgtg tccgagggca
2041 ccaaggccgt caccaagtac accagcgcca agtaatctct ctctccccat tccctgcccc
2101 acaaacccaa aggctctttt cagagccacc cacctcctct gtacaagggc tgcacctagc
2161 ttccactttc atccagagtc gcttagtatt tacattcaac ttctatctag aagatttaca
2221 aacacccttc tgtgaagagg ctttcaggcc acggtctact agtttaacgt ctgaagcctt
2281 ccttactccg tgcagtattt gcctaaaaac aggatttggc tcttttcatc actagaacaa
2341 aacgaccaca acgcttctgc acggttttca cttgatggct ttacgtcacc gttgtttcag
2401 tccacgcatt agaacacaac agatgaggca ccgacacact ccaagcctgc acttgtgatc
2461 cttaccagcg cggcccattc agcttcttgg ggcaaagaga acacgctgct gattccactg
2521 agcccgccca aaatgaagaa gctcctttat aaacccgtac agaattggct aaatccattg
2581 gtagcctttt aaaccatgac atggccacag agtagttcca aggaaaccat ttcaaagcca
2641 cccctgctgc tggctgatga cgttttggaa actacgctag aagtcaaagg agcccttctt
2701 caacctgctt gctccattcc tctgccctgt gtatgttttt cacgtacata tttctttgta
2761 acacacacac acacacacac acacacacac acacacacac acacacacac acacacatga
2821 atcagttcct cccttaggtt aacccttagc actgcctagt cctaggccgg agttctagtt
2881 ggatcagttt tgccattaat ccgctttctg gagggctccg ggtcggattt cctttggcgt
2941 cagatgactg aagtggagta taagcgggtg gtctctacct gctctctagt tgtgctccat
3001 gtaacacaat ggttgttcac cttccaaaca ccttttccaa tctagttttt ttcacattcc
3061 tcaccagaaa caaagacttt cttcaattac cttctatttt ttattgtttt tctaaagttg
3121 aagtttaaaa acgtgaatgt ccctggcctt tcagtctggc agctcagtta ttcaggcgca
3181 gattctgaac tgttacaact ttgcaacatt tgggtataac aattggttga tgcaaatttc
3241 agcaacatcg ctggtgaatt agcaactatt gtatcaattc tgactgctgc ctgtaatcaa
3301 ggaaactcag ggattctgcg ctgcagggac aaacagaaga aatgtatcaa tttagaaagg
3361 agtctaaatt agagtcagcg accccctcct ttcatagcta ctttagaatg taaaaaatgt
3421 aaactttaca cttccgttct atttgtgatg tttttctaat attggctgca aaaagtcttc
3481 atttctagtg aagcatctga aaacaactaa actggaaata gtttggaagg tgacgaaccc
3541 tttttaatgc tttagagacg agacttaggt ttatagatat gtacgaaata caagagatca
3601 tataggggta cagccttttc aagaaataga aaagggccgt tttgatcaga atacagaagc
3661 gagggtttta acctcttcct cgctctcatc cccgttgctc tacagattta cttgacctga
3721 tagggaactg caaggactgt ctccacccct tattccctag cgtttcaaag gaatcacttg
3781 gcggcttccg attggctggc gtgtgatttt gaacctgaac acgattgccc gccaaaagga
3841 ttgcgttaac tggctacagt acagtaacat cacagctttg tccccagcat tgcgccatga
3901 aattcgatcc tactcaatcg cctccgctgg gatagacttt gtgcaggttt catgtattta
3961 tctatttatc tggtacatcc tttgcagcaa aatcgctcca cttctccttt actcctttcc
4021 ccaccgcccc actaataggc gcaacccaac gatcctctca gctgagctcg gtgttcctct
4081 ggccgtttgt tacttctcta aagatttcca gtaatgagcg gataaaagta gcgctttcca
4141 cattgttaca ctgggacatt ttcgccgcgt tactcgagcg agatacaact cgcaacaagg
4201 tttcccgcaa ttcagcgcgc tccccccccc ccccccccca aagcttgtgt ttcgttggct
4261 gcgaggaagc ctaaccaatc ggcagagaga aggactctcg ctgctgacta taaaaagaaa
4321 gtaccgctac actataggcg aatcattttg tctaacgaag tcgtttgaag catgtcagga
4381 agaggcaaac aaggcggaaa gacccgggcc aaggccaaga ctcgctcatc tcgggccggc
4441 cttcagttcc cggtcggccg tgttcaccgg ctcttgagaa aaggcaatta tgccgagcgg
4501 gtgggagccg gagctccagt ctaccttgcc gcggtgctcg agtatctgac cgccgagatc
4561 ctcgagttgg ccggcaacgc tgcccgggat aacaagaaga ctcggatcat ccccaggcac
4621 ctgcagctcg ccgtgcgcaa cgacgaggag ctcaacaaac tgctcggagg ggtcaccatc
4681 gcccagggcg gtgtcctgcc caacatccag tctgtgctgc tccccaagaa aaccgagagc
4741 gccaaatctg ccaagagcaa gtgagctctt ccctccaaaa atactactgc cctgaccact
4801 cacccaaagg ctcttttcag agccacccac ttcatctaaa cagcgctgta tagccccgtg
4861 tgtgtgtgtg tgtgtgtgtg tgtgtgtcta aatagtgaga ataaattgtg acacataaga
4921 gggtttccct tgtacacttt ttcctcactt gattgccagg gaaatgatgt ggtgggagac
4981 gggcagctat tttgccgaga ttactgcccc ctaatacgag atttttgtga tgagggaaga
5041 gttggtctag ggactccgaa actccaatct tctcagtgtg agcaatgaac gcgtgtgtgt
5101 gtgtgtatgc gcaggagaaa aacaaattaa taaaaccgtc tgaaaagtgt gtgattactc
5161 tcgggctgtg aagtgaaggc cactcatgat cattgtcgta caagtccatt caactaggac
5221 gttttgcctg agccgcgcga agtgcttgtg cccgattgaa gcagcacaaa agccaggctg
5281 tcgggtcacc cggtgctgct agagcttggc ttcttttcgg aggattaggt agaaagtgaa
5341 tgaggaaccc ccctccgcct cgattaaggg aacgaaggat ttgcaccttg gccgcctggt
5401 aattgtgtgt tgatatttgt acatggggga agatgtgtga agccctgata acagtggagc
5461 acagctgatt gtcagacagt gttggtggct ctgaaaaaag agccttttgt gtgagggagt
5521 gagcgagcga tggagacggg gaggagtgag tctaagccct ctctcctcgg atcctgcggg
5581 ccagctggat gtccttgggc atgatggtga ctctcttggc gtggatggcg cacaggttgg
5641 tgtcctcgaa gagccccacc agataagcct cgctggcctc ctgcagagcc atgacggctg
5701 agctctggaa gcggagatcg gtcttgaagt cctgggcgat ctcccgaact aaacgctgga
5761 aaggcagttt gcgaatgagc agctcggtgg atttctggta acggcgaatt tcccggagag
5821 ccacggtgcc tgggcgatag cgatggggtt tcttgactcc gccggtggcc ggggcgctct
5881 tgcgggctgc cttggtggcc aactgcttgc ggggagcctt cccgccggtg gatttgcggg
5941 cggtctgctt ggtacgagcc atgtcaacag aaagcttttc actaactcca ataccagcct
6001 cacggccgat ttctcccttt tattagaagc gccttaatac cattggctct ttttttaacc
6061 acacccctcc ttcttcatct ttgattggtt cttagccgtt tcaaactgcc cgctaaaagt
6121 tacagatttt aagcgaggta ctttttcctc atacaaaacc tttttttttt tttctttttt
6181 ctttctctgc atactcattt cctgaagtcg acttgaaatt ccgaagatcg accgggtttc
6241 ctcgttattt actttccgga atagaaacgg ccgctgtatg ggggctggga ggtgttactg
6301 atttcacacg gtggcagcgc tgagagctaa tgcgaacgtc agatctttca atcccttcac
6361 aatacaaaga aatggccagt tagcagatga gttccagccc cagccacacc tagtagaata
6421 cttggggacg aacatgttgg gcaggggaac gccaagtgct cgtataggct gaagctgcct
6481 gatcaacgac aagttacaca cagacgaatg tgatcatagt gactgtcact tgggcaccag
6541 ttaaatggac tatatagtaa aagactcaat cgccatatta tgcccccctt cccccttccc
6601 ccttccccct tcccccttcc ccccccaata acatactttg tgcgctttac agagcgaatg
6661 tcccagtgta acaactgtcg agtaggtatc atatttacaa attagcacga tagatttggc
6721 gggcacttta caatgatggg aaggtttgac gctgaaaaga cgttttagga atgactactg
6781 gaggttgacg tttatccaat cacgagttaa cacatcgaac tcgtagaagc cgatgggaaa
6841 ggggcggggt gtgcttgctg ttccctatca gtcagcccag tgtgtatatg atttagggtc
6901 tgagcgcctg gactcacaca ttcgataagg actttagtga gtgaaatcat gtctggaaga
6961 ggcaagggcg gaaagggtct gggcaaagga ggcgccaaac gtcacaggaa ggtgctgcgg
7021 gataacatcc agggcatcac caagcccgcc atccgccgcc tcgcacgcag agggggagtc
7081 aagcgcatct ccggcctcat ctacgaggag actcgcgggg tgctcaaagt tttcctggag
7141 aacgttatcc gggacgccgt cacctacacc gagcacgcca agaggaagac cgttaccgct
7201 atggatgtgg tgtatgctct gaagcgccaa ggacgcactc tgtacggatt cggaggttaa
7261 ggctcgctaa ggttttttat atttttcccc tcatcaaaat aacggccctt ttaagggcca
7321 cccacctatt ccttcaaaag gctgcatccg tctatttaga tctattgttt gaagttgtat
7381 accgtaatat cgaaatcttt cttattaacc gcttgtacaa gagtcggtgc cgcccgcatg
7441 tctctacaca gtggctgcta tagtaccgag tggatgggga aataaggctg ctttgctcat
7501 aatgagggag gagaggccaa tcgagcctcc ttacaatcaa atgctatagc tgatgttcta
7561 gcggcggttt tgtagcagct caggagccag agaaacagtg gactgacacg tgatcgctac
7621 aatacaggcg gcggcggcgg cggcggctaa aaacctgctc tttgtgctgc tgctgctgct
7681 gctgctgctg ctgctgctgc tacaggacac ggtcatttgg ggaagggaag tgtggatctg
7741 ggatgtgtag aatcgcaaga acgttacaat gcatggatga ttttaccagt gcgatttcta
7801 ctcactgaca ataaaatata cacggtgtag ccataggcat tcgcggtgtc attttggcgg
7861 cctgggagat aaaaagcggc ttctcgcaac tcaaaggaca aagggacttg ttgtttctct
7921 tcctgctccg ccctcacacg gtaaggacgg gaactgcagc cagacgctcc ccactagcag
7981 acaagtctcc tggatactac actgcaataa cttactgttg cacagttcag caactattca
8041 ttgtcattcc atccatcaac tatgaagagg catagacata gcagctgtaa aataagcacc
8101 tcctatgaaa taatgcaaag aactagaagt tctgtgccag caatccctta taaactctaa
8161 tctttataca ccacaaggta gagctctatc agacattcat gaggcctata tgggattata
8221 ctaatctctt gaccacttct aatatactat cctgttatga ccctgtggtg tttatacaac
8281 tggcctgcag atgtcactgt gctcaagact tatactgggc atactttcta tacagcttgt
8341 ggtgttagag ttgcctcctg ttgtgtaatg cctgcatgag tctgctaata aagcactgtt
8401 atactctact catccttggc taccaaaaca taacaaattg aggatgagat ggctcttcta
8461 cctctgcagc aggctgcaga ttttcttcca aaccctggtg agcctaccat acctttaact
8521 gcttggatcc gtatgtttca aaattacgtt attgctgctg accaagggga gatttctgct
8581 gctagaaagc tt
//