GenBank-Updates@genbank.bio.net (05/29/91)
LOCUS XELHISH3A 8592 bp ds-DNA VRT 28-MAY-1991 DEFINITION Xenopus laevis histone gene cluster XlH3-A with genes H1A, H2B, H3 and H4 ACCESSION X03018 KEYWORDS histone; histone H1A; histone H2A; histone H2B; histone H3; histone H4; inverted repeat; tandem repeat. SOURCE Xenopus laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 8592) AUTHORS Perry,M., Thomsen,G.H. and Roeder,R.G. TITLE Genomic organization and nucleotide sequence of two distinct histone gene clusters from Xenopus laevis - Identification of novel conserved upstream sequence elements JOURNAL J. Mol. Biol. 185, 479-499 (1985) STANDARD full automatic COMMENT SWISS-PROT; P02281; H2B1$XENLA. SWISS-PROT; P02304; H4$HUMAN. SWISS-PROT; P06892; H1A$XENLA. SWISS-PROT; P06897; H2A1$XENLA. SWISS-PROT; P16105; H32$BOVIN. Histone genes contain a highly conserved region consisting of an inverted repeat (FT: INVREP A-E) required for accurate processing of the 3'end of the histone mRNA. The alternating purine-pyrimidine stretches might be expected in the left-handed Z-DNA conformation under conditions of topological stress. From EMBL entry XLHISH3A; dated 22-DEC-1988. FEATURES Location/Qualifiers promoter 522..526 /note="CCAAT box (H1A)" promoter 546..552 /note="TATAA box (H1A)" misc_RNA 577..582 /note="put. CAP site (H1A)" CDS 615..1244 /note="histone H1A (aa 1-210)" /codon_start=615 promoter 1577..1581 /note="CCAAT box (H2B)" promoter 1623..1629 /note="TATAA box (H2B)" misc_RNA 1649..1654 /note="put. CAP site (H2B)" CDS 1695..2072 /note="histone H2B (aa 1-126)" /codon_start=1695 promoter 4275..4281 /note="CCAAT box (H2A)" promoter 4309..4314 /note="TATAA box (H2A)" misc_RNA 4344..4349 /note="put. CAP site (H2A)" CDS 4372..4761 /note="histone H2A (aa 1-130)" /codon_start=4372 CDS complement(5555..5962) /note="histone H3 (aa 1-136)" /codon_start=5962 misc_RNA complement(5989..5994) /note="put. CAP site" promoter complement(6018..6025) /note="TATAA box (H3)" promoter complement(6042..6046) /note="CCAAT box (H3)" promoter 6842..6848 /note="CCAAT box homologue sequence (H4)" promoter 6885..6891 /note="TATAA box (H4)" misc_RNA 6919..6923 /note="put. CAP site (H4)" CDS 6949..7257 /note="histone H4 (aa 1-103)" /codon_start=6949 BASE COUNT 2226 a 2244 c 2039 g 2083 t ORIGIN 1 ccatggtgta acgaatgaca aaacacaatg cacacgagta tgaaggctgc agggactgga 61 ctgcagcaga ttcgcttctg tgctactttc ctgccagtgc aaagcccgcg gatttctttt 121 gtagcctcca aagccctcgc tgtcccactg atatcttgcc ccataacctt tcctctctct 181 ggaataagaa ctggaggaaa tggatggaag tgagcaagtc aaatgaaagc cttaagagaa 241 tatcccccag atagacagga gtgtgcctta gatatcaggg ctgttttggg acaatcgctg 301 caaccagtat gcagaaaatc gctgtcagag acaggacgtt tcccaagcag gcgactgtac 361 aatcactggg aaacgcttgg aagtcgattt tattaataac tttgcttatt gagagcctgg 421 aagcacagaa tgaaagctcc ctaaaagccc gacacggaca agaaaataat ggcgtgacta 481 cgctttgtcc aattagaact caattttaca ataaaactga gccaatcaac agacagaaca 541 ccttgtatat aaggagaagt ggaaagtcca agctccgtgt ttatcttttg taaaagaacg 601 acagagaatc tgcaatggct gaagccgccg aatccgcgcc cgctcctccc ccggctgagc 661 ccgcggccaa gaaaaagaaa cagcagccca agaaagcagc agcagcacgg ggggccgcta 721 aatccaagaa gccctcgtct ggacccagtg tgtccgagca gatcgtcaca gccgtgtccg 781 cttccaagga gcgcagcggg gtgtctctgg cagcgctcaa gaagactctg gctgcgggag 841 gctacgatgt ggacaagaac aacagccgcc tcaagctggc tctcaaggtc accaaggaga 901 ccctgctcca agtcaaaggc agcggagcct ccggttcctt caagctcaac aagaagcagc 961 tgcagagcaa ggacaaggcc gccgccaaga agaaggcgcc gctagcagcg gaagccaaga 1021 aaccagcggc agcagccaag aagacagcca agtccccgaa gaagcccaag aaggtctcgg 1081 cagccgccaa gagcccaaag aagctcaaga aacccgcaaa ggccgccaag agtcccgcta 1141 aaaagaccgc cgtcaagccc aaagttgctg ccaaaagccc cgcaaaggcc aaagcagcca 1201 aacccaaagt ggccaaagcc aagaaagccg cccccaagaa gaaatgagca gctcgctcgc 1261 tcgctcacta tagtggccaa ttcaaccaag gctcttttaa gagccaccac acccccctga 1321 aagagcttac aacttcccgc gtctcctgct tctaccacaa gtctccctac ataccgtaat 1381 attttctcac taacaccact acaagttccc acatgtagtc gtttagtggt gccggccgcc 1441 gcgagtacca actcggctta aacattctat tcggagggag gagggcggag agagttaatg 1501 gacgttgccg ggaagcttta cttaccacca atcgtctgga gaaaagctgc gctttgacgt 1561 catgccacag agccagccaa tgggaatcag tgtcacggcg ccagtgcttt acatgggcag 1621 ggtataaaag cagctgcagc cggagcagca cttcatcgtt tgctttatag actcatcctg 1681 tctagttgct gaacatgcct gagcccgcca aatccgctcc agccccaaag aagggctcca 1741 aaaaagccgt cactaaaacc cagaagaagg atggcaagaa gcgtaggaag agcaggaaag 1801 agagctacgc catctacgtg tacaaagtgc tcaagcaggt gcaccccgat accggcatct 1861 cttccaaggc catgagcatc atgaactcct ttgttaacga tgtcttcgag cgcatcgcag 1921 gggaagcctc ccgcctggct cactacaaca agcgctccac catcacctcc cgggagatcc 1981 agaccgcggt ccgcctgctc ttgcctgggg agctggccaa gcacgccgtg tccgagggca 2041 ccaaggccgt caccaagtac accagcgcca agtaatctct ctctccccat tccctgcccc 2101 acaaacccaa aggctctttt cagagccacc cacctcctct gtacaagggc tgcacctagc 2161 ttccactttc atccagagtc gcttagtatt tacattcaac ttctatctag aagatttaca 2221 aacacccttc tgtgaagagg ctttcaggcc acggtctact agtttaacgt ctgaagcctt 2281 ccttactccg tgcagtattt gcctaaaaac aggatttggc tcttttcatc actagaacaa 2341 aacgaccaca acgcttctgc acggttttca cttgatggct ttacgtcacc gttgtttcag 2401 tccacgcatt agaacacaac agatgaggca ccgacacact ccaagcctgc acttgtgatc 2461 cttaccagcg cggcccattc agcttcttgg ggcaaagaga acacgctgct gattccactg 2521 agcccgccca aaatgaagaa gctcctttat aaacccgtac agaattggct aaatccattg 2581 gtagcctttt aaaccatgac atggccacag agtagttcca aggaaaccat ttcaaagcca 2641 cccctgctgc tggctgatga cgttttggaa actacgctag aagtcaaagg agcccttctt 2701 caacctgctt gctccattcc tctgccctgt gtatgttttt cacgtacata tttctttgta 2761 acacacacac acacacacac acacacacac acacacacac acacacacac acacacatga 2821 atcagttcct cccttaggtt aacccttagc actgcctagt cctaggccgg agttctagtt 2881 ggatcagttt tgccattaat ccgctttctg gagggctccg ggtcggattt cctttggcgt 2941 cagatgactg aagtggagta taagcgggtg gtctctacct gctctctagt tgtgctccat 3001 gtaacacaat ggttgttcac cttccaaaca ccttttccaa tctagttttt ttcacattcc 3061 tcaccagaaa caaagacttt cttcaattac cttctatttt ttattgtttt tctaaagttg 3121 aagtttaaaa acgtgaatgt ccctggcctt tcagtctggc agctcagtta ttcaggcgca 3181 gattctgaac tgttacaact ttgcaacatt tgggtataac aattggttga tgcaaatttc 3241 agcaacatcg ctggtgaatt agcaactatt gtatcaattc tgactgctgc ctgtaatcaa 3301 ggaaactcag ggattctgcg ctgcagggac aaacagaaga aatgtatcaa tttagaaagg 3361 agtctaaatt agagtcagcg accccctcct ttcatagcta ctttagaatg taaaaaatgt 3421 aaactttaca cttccgttct atttgtgatg tttttctaat attggctgca aaaagtcttc 3481 atttctagtg aagcatctga aaacaactaa actggaaata gtttggaagg tgacgaaccc 3541 tttttaatgc tttagagacg agacttaggt ttatagatat gtacgaaata caagagatca 3601 tataggggta cagccttttc aagaaataga aaagggccgt tttgatcaga atacagaagc 3661 gagggtttta acctcttcct cgctctcatc cccgttgctc tacagattta cttgacctga 3721 tagggaactg caaggactgt ctccacccct tattccctag cgtttcaaag gaatcacttg 3781 gcggcttccg attggctggc gtgtgatttt gaacctgaac acgattgccc gccaaaagga 3841 ttgcgttaac tggctacagt acagtaacat cacagctttg tccccagcat tgcgccatga 3901 aattcgatcc tactcaatcg cctccgctgg gatagacttt gtgcaggttt catgtattta 3961 tctatttatc tggtacatcc tttgcagcaa aatcgctcca cttctccttt actcctttcc 4021 ccaccgcccc actaataggc gcaacccaac gatcctctca gctgagctcg gtgttcctct 4081 ggccgtttgt tacttctcta aagatttcca gtaatgagcg gataaaagta gcgctttcca 4141 cattgttaca ctgggacatt ttcgccgcgt tactcgagcg agatacaact cgcaacaagg 4201 tttcccgcaa ttcagcgcgc tccccccccc ccccccccca aagcttgtgt ttcgttggct 4261 gcgaggaagc ctaaccaatc ggcagagaga aggactctcg ctgctgacta taaaaagaaa 4321 gtaccgctac actataggcg aatcattttg tctaacgaag tcgtttgaag catgtcagga 4381 agaggcaaac aaggcggaaa gacccgggcc aaggccaaga ctcgctcatc tcgggccggc 4441 cttcagttcc cggtcggccg tgttcaccgg ctcttgagaa aaggcaatta tgccgagcgg 4501 gtgggagccg gagctccagt ctaccttgcc gcggtgctcg agtatctgac cgccgagatc 4561 ctcgagttgg ccggcaacgc tgcccgggat aacaagaaga ctcggatcat ccccaggcac 4621 ctgcagctcg ccgtgcgcaa cgacgaggag ctcaacaaac tgctcggagg ggtcaccatc 4681 gcccagggcg gtgtcctgcc caacatccag tctgtgctgc tccccaagaa aaccgagagc 4741 gccaaatctg ccaagagcaa gtgagctctt ccctccaaaa atactactgc cctgaccact 4801 cacccaaagg ctcttttcag agccacccac ttcatctaaa cagcgctgta tagccccgtg 4861 tgtgtgtgtg tgtgtgtgtg tgtgtgtcta aatagtgaga ataaattgtg acacataaga 4921 gggtttccct tgtacacttt ttcctcactt gattgccagg gaaatgatgt ggtgggagac 4981 gggcagctat tttgccgaga ttactgcccc ctaatacgag atttttgtga tgagggaaga 5041 gttggtctag ggactccgaa actccaatct tctcagtgtg agcaatgaac gcgtgtgtgt 5101 gtgtgtatgc gcaggagaaa aacaaattaa taaaaccgtc tgaaaagtgt gtgattactc 5161 tcgggctgtg aagtgaaggc cactcatgat cattgtcgta caagtccatt caactaggac 5221 gttttgcctg agccgcgcga agtgcttgtg cccgattgaa gcagcacaaa agccaggctg 5281 tcgggtcacc cggtgctgct agagcttggc ttcttttcgg aggattaggt agaaagtgaa 5341 tgaggaaccc ccctccgcct cgattaaggg aacgaaggat ttgcaccttg gccgcctggt 5401 aattgtgtgt tgatatttgt acatggggga agatgtgtga agccctgata acagtggagc 5461 acagctgatt gtcagacagt gttggtggct ctgaaaaaag agccttttgt gtgagggagt 5521 gagcgagcga tggagacggg gaggagtgag tctaagccct ctctcctcgg atcctgcggg 5581 ccagctggat gtccttgggc atgatggtga ctctcttggc gtggatggcg cacaggttgg 5641 tgtcctcgaa gagccccacc agataagcct cgctggcctc ctgcagagcc atgacggctg 5701 agctctggaa gcggagatcg gtcttgaagt cctgggcgat ctcccgaact aaacgctgga 5761 aaggcagttt gcgaatgagc agctcggtgg atttctggta acggcgaatt tcccggagag 5821 ccacggtgcc tgggcgatag cgatggggtt tcttgactcc gccggtggcc ggggcgctct 5881 tgcgggctgc cttggtggcc aactgcttgc ggggagcctt cccgccggtg gatttgcggg 5941 cggtctgctt ggtacgagcc atgtcaacag aaagcttttc actaactcca ataccagcct 6001 cacggccgat ttctcccttt tattagaagc gccttaatac cattggctct ttttttaacc 6061 acacccctcc ttcttcatct ttgattggtt cttagccgtt tcaaactgcc cgctaaaagt 6121 tacagatttt aagcgaggta ctttttcctc atacaaaacc tttttttttt tttctttttt 6181 ctttctctgc atactcattt cctgaagtcg acttgaaatt ccgaagatcg accgggtttc 6241 ctcgttattt actttccgga atagaaacgg ccgctgtatg ggggctggga ggtgttactg 6301 atttcacacg gtggcagcgc tgagagctaa tgcgaacgtc agatctttca atcccttcac 6361 aatacaaaga aatggccagt tagcagatga gttccagccc cagccacacc tagtagaata 6421 cttggggacg aacatgttgg gcaggggaac gccaagtgct cgtataggct gaagctgcct 6481 gatcaacgac aagttacaca cagacgaatg tgatcatagt gactgtcact tgggcaccag 6541 ttaaatggac tatatagtaa aagactcaat cgccatatta tgcccccctt cccccttccc 6601 ccttccccct tcccccttcc ccccccaata acatactttg tgcgctttac agagcgaatg 6661 tcccagtgta acaactgtcg agtaggtatc atatttacaa attagcacga tagatttggc 6721 gggcacttta caatgatggg aaggtttgac gctgaaaaga cgttttagga atgactactg 6781 gaggttgacg tttatccaat cacgagttaa cacatcgaac tcgtagaagc cgatgggaaa 6841 ggggcggggt gtgcttgctg ttccctatca gtcagcccag tgtgtatatg atttagggtc 6901 tgagcgcctg gactcacaca ttcgataagg actttagtga gtgaaatcat gtctggaaga 6961 ggcaagggcg gaaagggtct gggcaaagga ggcgccaaac gtcacaggaa ggtgctgcgg 7021 gataacatcc agggcatcac caagcccgcc atccgccgcc tcgcacgcag agggggagtc 7081 aagcgcatct ccggcctcat ctacgaggag actcgcgggg tgctcaaagt tttcctggag 7141 aacgttatcc gggacgccgt cacctacacc gagcacgcca agaggaagac cgttaccgct 7201 atggatgtgg tgtatgctct gaagcgccaa ggacgcactc tgtacggatt cggaggttaa 7261 ggctcgctaa ggttttttat atttttcccc tcatcaaaat aacggccctt ttaagggcca 7321 cccacctatt ccttcaaaag gctgcatccg tctatttaga tctattgttt gaagttgtat 7381 accgtaatat cgaaatcttt cttattaacc gcttgtacaa gagtcggtgc cgcccgcatg 7441 tctctacaca gtggctgcta tagtaccgag tggatgggga aataaggctg ctttgctcat 7501 aatgagggag gagaggccaa tcgagcctcc ttacaatcaa atgctatagc tgatgttcta 7561 gcggcggttt tgtagcagct caggagccag agaaacagtg gactgacacg tgatcgctac 7621 aatacaggcg gcggcggcgg cggcggctaa aaacctgctc tttgtgctgc tgctgctgct 7681 gctgctgctg ctgctgctgc tacaggacac ggtcatttgg ggaagggaag tgtggatctg 7741 ggatgtgtag aatcgcaaga acgttacaat gcatggatga ttttaccagt gcgatttcta 7801 ctcactgaca ataaaatata cacggtgtag ccataggcat tcgcggtgtc attttggcgg 7861 cctgggagat aaaaagcggc ttctcgcaac tcaaaggaca aagggacttg ttgtttctct 7921 tcctgctccg ccctcacacg gtaaggacgg gaactgcagc cagacgctcc ccactagcag 7981 acaagtctcc tggatactac actgcaataa cttactgttg cacagttcag caactattca 8041 ttgtcattcc atccatcaac tatgaagagg catagacata gcagctgtaa aataagcacc 8101 tcctatgaaa taatgcaaag aactagaagt tctgtgccag caatccctta taaactctaa 8161 tctttataca ccacaaggta gagctctatc agacattcat gaggcctata tgggattata 8221 ctaatctctt gaccacttct aatatactat cctgttatga ccctgtggtg tttatacaac 8281 tggcctgcag atgtcactgt gctcaagact tatactgggc atactttcta tacagcttgt 8341 ggtgttagag ttgcctcctg ttgtgtaatg cctgcatgag tctgctaata aagcactgtt 8401 atactctact catccttggc taccaaaaca taacaaattg aggatgagat ggctcttcta 8461 cctctgcagc aggctgcaga ttttcttcca aaccctggtg agcctaccat acctttaact 8521 gcttggatcc gtatgtttca aaattacgtt attgctgctg accaagggga gatttctgct 8581 gctagaaagc tt //