[bionet.molbio.genbank.updates] Rabbit beta1-globin gene

GenBank-Updates@genbank.bio.net (05/09/91)

LOCUS       RABHBBB1B1   7547 bp ds-DNA             MAM       09-MAY-1991
DEFINITION  Rabbit beta1-globin gene (allele 2), complete cds and L1 repetitive
            sequences.
ACCESSION   K03256 M12603
KEYWORDS    L1 repetitive sequence; beta-1-globin; beta-globin; globin;
            pseudogene.
SEGMENT     1 of 2
SOURCE      Rabbit DNA [Mol. Cell. Biol. 5, 147-160 (1985)], clones L1Oc-[4,5]
            [Mol. Biol. Evol. 3, 179-190 (1986)].
  ORGANISM  Oryctolagus cuniculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 4161)
  AUTHORS   Rohrbaugh,M.L., Johnson,J.E., James,M.D. and Hardison,R.C.
  TITLE     Transcription unit of the rabbit beta1 globin gene
  JOURNAL   Mol. Cell. Biol. 5, 147-160 (1985)
  STANDARD  full staff_review
REFERENCE   2  (bases 3067 to 5556)
  AUTHORS   Demers,G.W., Brech,K. and Hardison,R.C.
  JOURNAL   Unpublished (1986) Pennsylvania State U. University Park, PA 16802
  STANDARD  full staff_review
REFERENCE   3  (bases 4258 to 7547)
  AUTHORS   Demers,G.W., Brech,K. and Hardison,R.C.
  TITLE     Long interspersed L1 repeats in rabbit DNA are homologous to L1
            repeats of rodents and primates in an open-reading-frame region
  JOURNAL   Mol. Biol. Evol. 3, 179-190 (1986)
  STANDARD  full staff_review
COMMENT
            [Unpublished (1986) Pennsylvania State U. University Park, PA
            16802]  revises [Mol. Biol. Evol. 3, 179-190 (1986)].
            
            Draft entry and sequence in computer readable form for [Unpublished
            (1986) Pennsylvania State U. University Park, PA 16802] kindly
            provided by M.Rohrbaugh, 16-AUG-1985.  Draft entry and sequence in
            computer readable form for [Mol. Biol. Evol. 3, 179-190 (1986)]
            kindly provided by R.C.Hardison,
            15-JUL-1986.
            
            A 'CAAT' box is present at position 350-358, a 'TATA' box is found
            at position 396-401.  A polyadenylation signal is located at
            position 1690-1695.
            
            The putative ancestral reading frame of the L10c repeats ended with
            the stop codon at position 4479-4477 (on comp strand) and continued
            beyond the end of this repeat element [Mol. Biol. Evol. 3, 179-190
            (1986)].  The RNA transcribed
            from the strand opposite the B1 gene (FL mRNA in sites) has been
            shown to be transcribed in fetal liver nuclei.  The 3' end of this
            transcript has not yet been determined.  An 'atg' codon is present
            at position 2168-2170 (on comp strand).
FEATURES             Location/Qualifiers
     mRNA            complement(<2245..2245)
                     /note="FL mRNA"
     CDS             join(480..571,698..920,1494..1622)
                     /note="beta-1 globin"
                     /codon_start=480
     mRNA            427..1714
                     /note="b1-g mRNA"
     exon            480..571
                     /note="beta-1 globin, exon 1"
     intron          572..697
                     /note="b1-g intron A"
     exon            698..920
                     /note="beta-1 globin, exon 2"
     intron          921..1493
                     /note="b1-g intron B"
     exon            1494..1622
                     /note="beta-1 globin, exon 3"
     repeat_region   complement(3234..7547)
                     /note="L10c copy 4 [Mol. Biol. Evol. 3, 179-190 (1986)],
                     [Unpublished (1986) Pennsylvania State U. University Park,
                     PA 16802]"
     misc_feature    5556..5557
                     /note="deletion in L10c copy 4"
BASE COUNT     1861 a   1488 c   1513 g   2685 t
ORIGIN      1 bp upstream of BglII site.
        1 agatctctct ctctctctct ctctctctct ctacctatct atttatctat ttaagtggat
       61 ttcaacacac aaatcttctc ccttttctgt gccttaaatc ctcatttgta tgataaataa
      121 ttgcagagaa aatttttcat aggcttacca ggctctaata acaaaaatta tataaataaa
      181 tttggcaaga aaggtgtttt cagtagcaat tagtactgct ggtatgggtc tgggagatac
      241 atagaaggaa ggctgagtct gtcagactcc taagccattg ccataactgc caaggacagg
      301 ggtgctgtca tcacccagac ctcaccctgc agagccacac cctggtgttg gccaatctac
      361 acacggggta gggattacat agttcaggac ttgggcataa aaggcagagc agggcagctg
      421 ctgcttacac ttgcttttga cacaactgtg tttacttgca atcccccaaa acagacagaa
      481 tggtgcatct gtccagtgag gagaagtctg cggtcactgc cctgtggggc aaggtgaatg
      541 tggaagaagt tggtggtgag gccctgggca ggttggtatc ctttttacag cacaacttaa
      601 tgagacagat agaaactggt cttgtagaaa cagagtagtc gcctgctttt ctgccaggtg
      661 ctgacttctc tcccctgggc tgttttcatt ttctcaggct gctggttgtc tacccatgga
      721 cccagaggtt cttcgagtcc tttggggacc tgtcctctgc acatgctgtt atgagcaatc
      781 ctaaggtgaa ggctcatggc aagaaggtgc tggctgcctt cagtgagggt ctgaatcacc
      841 tggacaacct caaaggcacc tttgctaagc tgagtgaact gcactgtgac aagctgcacg
      901 tggatcctga gaacttcagg gtgagtttgg ggacccttga ttgttctttc tttttcgcta
      961 ttgtaaaatt catgttatat ggagggggca aagttttcag ggtgttgttt agaatgggaa
     1021 gatgtccctt gtatcaccat ggaccctcat gataattttg tttctttcac tttctactct
     1081 gttgacaacc attgtctcct cttattttct tttcattttc tgtaactttt tcgttaaact
     1141 ttagcttgca tttgtaacga atttttaaat tcacttttgt ttatttgtca gattgtaagt
     1201 actttctcta atcacttttt tttcaaggca atcagggtat attatattgt acttcagcac
     1261 agttttagag aacaattgtt ataattaaat gataaggtag aatatttctg catataaatt
     1321 ctggctggcg tggaaatatt cttattggta gaaacaacta catcctggtc atcatcctgc
     1381 ctttctcttt atggttacaa tgatatacac tgtttgagat gaggataaaa tactctgagt
     1441 ccaaaccggg cccctctgct aaccatgttc atgccttctt ctttttccta cagctcctgg
     1501 gcaacgtgct ggttgttgtg ctgtctcatc attttggcaa agaattcact cctcaggtgc
     1561 aggctgccta tcagaaggtg gtggctggtg tggccaatgc cctggctcac aaataccact
     1621 gagatctttt tccctctgcc aaaaattatg gggacatcat gaagcccctt gagcatctga
     1681 cttctggcta ataaaggaaa tttattttca ttgcaatagt gtgttggaat tttttgtgtc
     1741 tctcactcgg aaggacatat gggagggcaa atcatttaaa acatcagaat gagtatttgg
     1801 tttagagttt ggcaacatat gccatatgct ggctgccatg aacaaaggtg gctataaaga
     1861 ggtcatcagt atatgaaaca gccccctgct gtccattcct tattccatag aaaagccttg
     1921 acttgaggtt agattttttt tatattttgt tttgtgttat ttttttcttt aacatcccta
     1981 aaattttcct tacatgtttt actagccaga tttttcctcc tctcctgact actcccagtc
     2041 atagctgtcc ctcttctctt atgaagatct tattaaagca gctgggacag ggacagaaaa
     2101 agggctttga ctgcctttct cttgagccct tttcctgatc tccacaactc actgatacca
     2161 ctggtctcat tggaaggggt gggctgttaa cagtgtgaca aatgtaggaa taaactggat
     2221 gcaaaagggg gctttgtgca gctttatatt cactgttgtc ttaaaccctt tttatggact
     2281 caaatcaaat gacagtccct caggatgtta gcttctgaat tcagaaagtg attgcagagt
     2341 tgcccactcc tttatcctgt gtctgatggt tttgctgtct ctgtagtgat tagcttatgt
     2401 caccatttcc tcattcaata ggcactaggt ggatgaaagg ttctggttca ctccccaaat
     2461 acctgcaaca gtcaggagtg tgtcaggcca aaaccagaaa acaggaattg ccatggggtc
     2521 tccatgatgg gtggcaggga ctcaagtaca tgagccatat tcggctgctt ccaggtacat
     2581 tagcagaaaa ctagatcaga agtggagctg tggggaccag aataaacact ttgatatggg
     2641 atgttggtgt ctcaagtagc aacttaaccc cctgctcact aaaacactct aatcctcatt
     2701 acctaggagc aactgagcct gagggctatc taatatagct ggtgacacag agatcatata
     2761 ccctggctaa aagcatggct gaatccatga aagaaaatat atgctcaaaa taggaataga
     2821 atacacagat ttatgcacag atgcttacaa attttagcca atcctgatga catggttaac
     2881 ttggagatct agatcagttc ttgccagcat gcccagagaa tagtacatgg gaaaatttat
     2941 agagatgatg agttagagac aaagtgagtg ataatgacat tgcctgggat tgctgctagg
     3001 tacactgaaa aatcagggag gaagatccaa taaatgaccc attcaaaatc tagaaaacct
     3061 gtcaacagga actttggaaa cttatttcta atgtatctga acatcaaggc agcaataagt
     3121 ctttctgtaa aatcattaaa tatgcccaaa tgtcaagttc tatgtgagtc atgaaggtaa
     3181 cttgataatg ctctacactt catattttgt tcattgttta atacaaaacg caatttttat
     3241 tttatttatt taatttttaa ctgtttattt aataaatata aatttccaaa ttacagctta
     3301 tagattacaa tggcttcatc ctcataactt gccttgccaa cctgcaaccc tcccatctcc
     3361 tgctccctct cccattccat tcacatcaag attcattttc aattatcttt atatacagaa
     3421 gatcaattta gtatatatta agtaaagatt ttaacagttt gcacccacac agaacataaa
     3481 gtataaatac tgtttgagta ctagttatag cattaattca cattgaacaa cacattaagg
     3541 acagagatcc tacatgagga gtaagtgcac agcgactcct gtcgttgact taacaaattg
     3601 acattcttgt ttagggggtc agttatctcc ccaggctcct gtcatgagtt accaaggcta
     3661 tggaggcctt ttgagttcac tgacttcgat cttatttaga caaggtcata gtgaaagtgg
     3721 aagtccactc ctccctttag agaacggtac ctccttcctc aatggcccat tctttcaact
     3781 gggatctcgc tcacagagat ctttcattta gctcatttaa ctcctttttt tttttttttt
     3841 tctagagcat cttacctttc cattgcctga aatactttca tgggctcttc agccagatgt
     3901 gaatgcctta agggctgatt ctgaggccag agtgctgttt aggacatgtg ccattctatg
     3961 agtctgatgt gtatcccatt tcccatgttg gaatgttctc tccattttta attctgtcag
     4021 ttagtattag cagacactag tcttgtttat gtgatccctc tgactcttat gcctatcatt
     4081 acgatcaatt gtgaacagaa attgatcact gggactagtg agatggcatt ggaacatggc
     4141 cacctcaatg ggattgaatt cgtaatcccc tggtctgttt ctaactctac catttgaggt
     4201 aagtcagttt gagcatgtcc cgaattgcac atctcttccc tctcttattc ccactcttat
     4261 atttaacagg gattactttt cagttaaatt taaacaccta agaataattg tgtgttaatt
     4321 acagagttca accaatagta ttaagtagaa caaccaaaaa atactaaaag ggataaagta
     4381 ttacattgta catcaacagt caggacaagg gctgttcaag tcactgtttc tcatagtgtt
     4441 catttcactt tgacaggttt cctttttggt gctgggtcag ttgtcactga tcagggagaa
     4501 catatgatat ttgtcccttt gggactggct tatttcactc agcatgatgt gttccagatt
     4561 cctccatttt gttgcaaatg accggatttc attgtttttt tttgcttcta tatagtattc
     4621 tatagagtac atgtcccata atttcttcct ccagtctact gttgatgggc atttgggttg
     4681 gttccaggtc ttagctattg tgaagtgagc cgcaataaac attgaggtgc agacagcttg
     4741 tttgtttgcc aatttaattt cctttgggta aattccagga gcgggatggc tgggttgtat
     4801 ggtagggtta tattcaggtt tctgaggatc tccagactga cttccatagg ggcttaacca
     4861 gtttgcattc ccaccaacag tgggttagtg tccctttctc cccacatcct ttccagcatc
     4921 tattgttggt agatttctgt atgtgagcca ttctaagcgg ggtgaggtga aacctcattg
     4981 tggttttgat ttgcatttcc ctgattgcta gcgatcttga acatttcttc atgtggatgt
     5041 tggccatttg gatttcctct tttcaaaaat ggcaagtgag gtccttggcc catctcttaa
     5101 gtgggttgtt tgttttgatg ctgtggagtt tctttatgtc tttgtggatt ctagctatta
     5161 atgctttatc tgttgcttag tttgcaaata ttttttccca ttctgtcagt tgcctcttca
     5221 cttcctgact gcttcttttg cagtacagaa cttctcaatt tgatgtaatc tcaatagtta
     5281 attttggctt tgactgcctg tgcctccagg gtcttttcca agaagtcttt gcggtgccaa
     5341 tatcttgcag ggtttctcca atgttctcta ataacttcat ggtgtcgggt catagattta
     5401 ggtctttaat ccatgttgag tggatttttg tgtaaggtgt aaggtagggg tctttcttca
     5461 tgcttcagca cgtggaaatc ccagcaccat ttattgaata gactgtcctt gctccaggaa
     5521 ttggttttag attcctgatc aaatataagt aggctgttgt atcccttcaa tttctttttc
     5581 ttgcctaaca gctctggcta aagcctccag aaatatactg aatagcagtg gtgagaatgg
     5641 atatccctgt atggtaccag atctcagtgg aaatgcttcc aactttttcc cattcaatag
     5701 gatgctggtc gtgggttttt cataaattgc tttgattgta ttgaggaaca ttccttctat
     5761 acccagttta cttagagttt tcaccatgaa agggtcttgt gttttattga atgctttctc
     5821 tgcatctatt gagataatca tatggttttt cttctgcagt ctgttaatgt ggtgtatcac
     5881 atttgcaaac acttgaacca tccctgcata ccagggttat atcccacttg gtctgggtga
     5941 atgatctttc tgaaatgttg ttgcactccg ttggccagaa ttttattgag aatttttgag
     6001 tctatgttca ttaggtatat tgttctgtaa ttttctttca atgctgcatc tttttccggc
     6061 ttaggaatta aggtgatgct ggattcatag aaagattttg ggaggattcc ctctttttca
     6121 attgttctga atagtttgag aagaattgag ttagttcttc tttaaatttc tggtagaatt
     6181 cagtagtgaa tccatctggt cctgggcttt tctttgttgg gagggccttt attactgttt
     6241 caatttctgc ctcagttatg ggtttgttta ggctttcgat gtcttcctgg ttcaatgtag
     6301 gtaggttgca ggtgtccagg aatctatgca tttctgatag atttccctgt ttgctggcat
     6361 acagtccttg tagtaatttc tgatgattct tttcatttct gtggtgtctg ctgttacatt
     6421 tcctatttca tctctgattt tattgatttg gtctcttctt cttttagtta gttgagctaa
     6481 tgcggtatca attttgttta ttttttcaaa aaaccagctc cccatttggc tgatttttgg
     6541 taattttttt ggattcaatc ctgttgattt cttctctgat tttaattatt tctcttctcc
     6601 tactagattt gggtctgctt tgctgcagtt tttctagatc cttgaggtga tttgaaagct
     6661 catctatttg gtgcctttcc aatttcttga tgtaggcacc tattgatata aacttttctc
     6721 ttaacactgc tttcgctgca tctcatacat tttggtatgt tgtgctgtta tcctcattta
     6781 cttccagaaa gtttttgatt tctcttttga tttctttgat gacctagtgt tcattcagga
     6841 gcatgttgtt cactctccat gtgtttgcat atgctgtagg gattcctgag ttgctaattt
     6901 ccgacttcat tctattatgg tctgagaagc tgcatcatat gattctaatt cttttgaatg
     6961 tgctgagact tgctttatgg cctagtatgt ggttaatctt agagtaggtt ccatgtactg
     7021 ctgagaagaa tgtaaattct ttaagtgcag gatgaaaagt tctgtagata tgtgtcagat
     7081 ccatctgggc tatagtatcc tttgaatgta ctgtttcctg tagtcttctg tcctgtgatc
     7141 tgtctatttc tgagagtgga gtattgaagt cccccagtac tattgtattg gagtctaagt
     7201 ctccctttaa ctctcttaac aaatctttta aataaaccgg tgccctgtaa ttaggtgcat
     7261 atacattgat aatcgttata ttttttctgt tgaattcatc ccttaatcat tatgtagtgc
     7321 ccctctttgt ctctcttaac agtttttgtg ctaaagttta ttttgtctga tattaagatg
     7381 gttatgcctg ctcttttttc atttctgttg gcatggacta tctttctcca gcctttcaca
     7441 tttcagtctg gatcgatctt tgttggaaag atgtgtttct gtaagcagca aatagatggg
     7501 ttttgttcct tgaacccaat cagccaatct atgtctttta actggag
//

GenBank-Updates@genbank.bio.net (05/10/91)

LOCUS       RABHBBB1B2    660 bp ds-DNA             MAM       10-MAY-1991
DEFINITION  Rabbit beta1-globin gene (allele 2) L1 repetitive sequence.
ACCESSION   K03415
KEYWORDS    L1 repetitive sequence; beta-1-globin; beta-globin; globin;
            pseudogene.
SEGMENT     2 of 2
SOURCE      Rabbit DNA [Mol. Biol. Evol. 3, 179-190 (1986)], clones L1Oc-[4,5]
            [2].
  ORGANISM  Oryctolagus cuniculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 660)
  AUTHORS   Demers,G.W., Brech,K. and Hardison,R.C.
  TITLE     Long interspersed L1 repeats in rabbit DNA are homologous to L1
            repeats of rodents and primates in an open-reading-frame region
  JOURNAL   Mol. Biol. Evol. 3, 179-190 (1986)
  STANDARD  full staff_review
COMMENT     Draft entry and sequence in computer readable form for [2] kindly
            provided by R.C.Hardison, 15-JUL-1986.
               Mia 31-JUL-1986 initial entry [Mol. Biol. Evol. 3, 179-190
            (1986)]
FEATURES             Location/Qualifiers
     repeat_region   complement(<1..>659)
                     /note="L10c-5 repeat"
BASE COUNT      142 a    112 c    130 g    276 t
ORIGIN      About 2.5 kb after segment 1.
        1 tagatgttgg atgattctgg tgtttcaatt ctgttccatt ggtctatcca tctgtttctg
       61 taccagtacc atgctgtttg ataactactg ccctgtagta tgtcctgaag tctggtatgt
      121 gactgccggc tttgtttttg tgtacaagat tgctttagct attcgaggtc tcttgtgcct
      181 ccatatgaat ttcagcatca ttttttctag atcatagaag aatgtctttg gtatcttgat
      241 tggtattgca ttgaatctat aaattgcttt tgggagaatg gacattttga tgatgttgat
      301 cttccaatcc atgagcatgg aagatttttc cattttttgg tatcctcttc tatttctttc
      361 tttaaggttt tgtaattttc atcgtagaga tctttaacgt ccttggttaa gtttattcca
      421 aggtatttga ttgtttttgt agctattgtg aatgggattg atcttagcag ttctttctca
      481 gccatggcat tgcttgtgta tacaaaggct gttgattttt gtgcattgat tttatatcct
      541 gccactttgc caaactcctc tatgagttcc aatagtctct tagtagagtt ctttggatcc
      601 tctaagtaca gaatcaatat cgtctgcaaa gagggatagt ttgacttctt ccttcttgat
//