GenBank-Updates@genbank.bio.net (05/09/91)
LOCUS RABHBBB1B1 7547 bp ds-DNA MAM 09-MAY-1991 DEFINITION Rabbit beta1-globin gene (allele 2), complete cds and L1 repetitive sequences. ACCESSION K03256 M12603 KEYWORDS L1 repetitive sequence; beta-1-globin; beta-globin; globin; pseudogene. SEGMENT 1 of 2 SOURCE Rabbit DNA [Mol. Cell. Biol. 5, 147-160 (1985)], clones L1Oc-[4,5] [Mol. Biol. Evol. 3, 179-190 (1986)]. ORGANISM Oryctolagus cuniculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 4161) AUTHORS Rohrbaugh,M.L., Johnson,J.E., James,M.D. and Hardison,R.C. TITLE Transcription unit of the rabbit beta1 globin gene JOURNAL Mol. Cell. Biol. 5, 147-160 (1985) STANDARD full staff_review REFERENCE 2 (bases 3067 to 5556) AUTHORS Demers,G.W., Brech,K. and Hardison,R.C. JOURNAL Unpublished (1986) Pennsylvania State U. University Park, PA 16802 STANDARD full staff_review REFERENCE 3 (bases 4258 to 7547) AUTHORS Demers,G.W., Brech,K. and Hardison,R.C. TITLE Long interspersed L1 repeats in rabbit DNA are homologous to L1 repeats of rodents and primates in an open-reading-frame region JOURNAL Mol. Biol. Evol. 3, 179-190 (1986) STANDARD full staff_review COMMENT [Unpublished (1986) Pennsylvania State U. University Park, PA 16802] revises [Mol. Biol. Evol. 3, 179-190 (1986)]. Draft entry and sequence in computer readable form for [Unpublished (1986) Pennsylvania State U. University Park, PA 16802] kindly provided by M.Rohrbaugh, 16-AUG-1985. Draft entry and sequence in computer readable form for [Mol. Biol. Evol. 3, 179-190 (1986)] kindly provided by R.C.Hardison, 15-JUL-1986. A 'CAAT' box is present at position 350-358, a 'TATA' box is found at position 396-401. A polyadenylation signal is located at position 1690-1695. The putative ancestral reading frame of the L10c repeats ended with the stop codon at position 4479-4477 (on comp strand) and continued beyond the end of this repeat element [Mol. Biol. Evol. 3, 179-190 (1986)]. The RNA transcribed from the strand opposite the B1 gene (FL mRNA in sites) has been shown to be transcribed in fetal liver nuclei. The 3' end of this transcript has not yet been determined. An 'atg' codon is present at position 2168-2170 (on comp strand). FEATURES Location/Qualifiers mRNA complement(<2245..2245) /note="FL mRNA" CDS join(480..571,698..920,1494..1622) /note="beta-1 globin" /codon_start=480 mRNA 427..1714 /note="b1-g mRNA" exon 480..571 /note="beta-1 globin, exon 1" intron 572..697 /note="b1-g intron A" exon 698..920 /note="beta-1 globin, exon 2" intron 921..1493 /note="b1-g intron B" exon 1494..1622 /note="beta-1 globin, exon 3" repeat_region complement(3234..7547) /note="L10c copy 4 [Mol. Biol. Evol. 3, 179-190 (1986)], [Unpublished (1986) Pennsylvania State U. University Park, PA 16802]" misc_feature 5556..5557 /note="deletion in L10c copy 4" BASE COUNT 1861 a 1488 c 1513 g 2685 t ORIGIN 1 bp upstream of BglII site. 1 agatctctct ctctctctct ctctctctct ctacctatct atttatctat ttaagtggat 61 ttcaacacac aaatcttctc ccttttctgt gccttaaatc ctcatttgta tgataaataa 121 ttgcagagaa aatttttcat aggcttacca ggctctaata acaaaaatta tataaataaa 181 tttggcaaga aaggtgtttt cagtagcaat tagtactgct ggtatgggtc tgggagatac 241 atagaaggaa ggctgagtct gtcagactcc taagccattg ccataactgc caaggacagg 301 ggtgctgtca tcacccagac ctcaccctgc agagccacac cctggtgttg gccaatctac 361 acacggggta gggattacat agttcaggac ttgggcataa aaggcagagc agggcagctg 421 ctgcttacac ttgcttttga cacaactgtg tttacttgca atcccccaaa acagacagaa 481 tggtgcatct gtccagtgag gagaagtctg cggtcactgc cctgtggggc aaggtgaatg 541 tggaagaagt tggtggtgag gccctgggca ggttggtatc ctttttacag cacaacttaa 601 tgagacagat agaaactggt cttgtagaaa cagagtagtc gcctgctttt ctgccaggtg 661 ctgacttctc tcccctgggc tgttttcatt ttctcaggct gctggttgtc tacccatgga 721 cccagaggtt cttcgagtcc tttggggacc tgtcctctgc acatgctgtt atgagcaatc 781 ctaaggtgaa ggctcatggc aagaaggtgc tggctgcctt cagtgagggt ctgaatcacc 841 tggacaacct caaaggcacc tttgctaagc tgagtgaact gcactgtgac aagctgcacg 901 tggatcctga gaacttcagg gtgagtttgg ggacccttga ttgttctttc tttttcgcta 961 ttgtaaaatt catgttatat ggagggggca aagttttcag ggtgttgttt agaatgggaa 1021 gatgtccctt gtatcaccat ggaccctcat gataattttg tttctttcac tttctactct 1081 gttgacaacc attgtctcct cttattttct tttcattttc tgtaactttt tcgttaaact 1141 ttagcttgca tttgtaacga atttttaaat tcacttttgt ttatttgtca gattgtaagt 1201 actttctcta atcacttttt tttcaaggca atcagggtat attatattgt acttcagcac 1261 agttttagag aacaattgtt ataattaaat gataaggtag aatatttctg catataaatt 1321 ctggctggcg tggaaatatt cttattggta gaaacaacta catcctggtc atcatcctgc 1381 ctttctcttt atggttacaa tgatatacac tgtttgagat gaggataaaa tactctgagt 1441 ccaaaccggg cccctctgct aaccatgttc atgccttctt ctttttccta cagctcctgg 1501 gcaacgtgct ggttgttgtg ctgtctcatc attttggcaa agaattcact cctcaggtgc 1561 aggctgccta tcagaaggtg gtggctggtg tggccaatgc cctggctcac aaataccact 1621 gagatctttt tccctctgcc aaaaattatg gggacatcat gaagcccctt gagcatctga 1681 cttctggcta ataaaggaaa tttattttca ttgcaatagt gtgttggaat tttttgtgtc 1741 tctcactcgg aaggacatat gggagggcaa atcatttaaa acatcagaat gagtatttgg 1801 tttagagttt ggcaacatat gccatatgct ggctgccatg aacaaaggtg gctataaaga 1861 ggtcatcagt atatgaaaca gccccctgct gtccattcct tattccatag aaaagccttg 1921 acttgaggtt agattttttt tatattttgt tttgtgttat ttttttcttt aacatcccta 1981 aaattttcct tacatgtttt actagccaga tttttcctcc tctcctgact actcccagtc 2041 atagctgtcc ctcttctctt atgaagatct tattaaagca gctgggacag ggacagaaaa 2101 agggctttga ctgcctttct cttgagccct tttcctgatc tccacaactc actgatacca 2161 ctggtctcat tggaaggggt gggctgttaa cagtgtgaca aatgtaggaa taaactggat 2221 gcaaaagggg gctttgtgca gctttatatt cactgttgtc ttaaaccctt tttatggact 2281 caaatcaaat gacagtccct caggatgtta gcttctgaat tcagaaagtg attgcagagt 2341 tgcccactcc tttatcctgt gtctgatggt tttgctgtct ctgtagtgat tagcttatgt 2401 caccatttcc tcattcaata ggcactaggt ggatgaaagg ttctggttca ctccccaaat 2461 acctgcaaca gtcaggagtg tgtcaggcca aaaccagaaa acaggaattg ccatggggtc 2521 tccatgatgg gtggcaggga ctcaagtaca tgagccatat tcggctgctt ccaggtacat 2581 tagcagaaaa ctagatcaga agtggagctg tggggaccag aataaacact ttgatatggg 2641 atgttggtgt ctcaagtagc aacttaaccc cctgctcact aaaacactct aatcctcatt 2701 acctaggagc aactgagcct gagggctatc taatatagct ggtgacacag agatcatata 2761 ccctggctaa aagcatggct gaatccatga aagaaaatat atgctcaaaa taggaataga 2821 atacacagat ttatgcacag atgcttacaa attttagcca atcctgatga catggttaac 2881 ttggagatct agatcagttc ttgccagcat gcccagagaa tagtacatgg gaaaatttat 2941 agagatgatg agttagagac aaagtgagtg ataatgacat tgcctgggat tgctgctagg 3001 tacactgaaa aatcagggag gaagatccaa taaatgaccc attcaaaatc tagaaaacct 3061 gtcaacagga actttggaaa cttatttcta atgtatctga acatcaaggc agcaataagt 3121 ctttctgtaa aatcattaaa tatgcccaaa tgtcaagttc tatgtgagtc atgaaggtaa 3181 cttgataatg ctctacactt catattttgt tcattgttta atacaaaacg caatttttat 3241 tttatttatt taatttttaa ctgtttattt aataaatata aatttccaaa ttacagctta 3301 tagattacaa tggcttcatc ctcataactt gccttgccaa cctgcaaccc tcccatctcc 3361 tgctccctct cccattccat tcacatcaag attcattttc aattatcttt atatacagaa 3421 gatcaattta gtatatatta agtaaagatt ttaacagttt gcacccacac agaacataaa 3481 gtataaatac tgtttgagta ctagttatag cattaattca cattgaacaa cacattaagg 3541 acagagatcc tacatgagga gtaagtgcac agcgactcct gtcgttgact taacaaattg 3601 acattcttgt ttagggggtc agttatctcc ccaggctcct gtcatgagtt accaaggcta 3661 tggaggcctt ttgagttcac tgacttcgat cttatttaga caaggtcata gtgaaagtgg 3721 aagtccactc ctccctttag agaacggtac ctccttcctc aatggcccat tctttcaact 3781 gggatctcgc tcacagagat ctttcattta gctcatttaa ctcctttttt tttttttttt 3841 tctagagcat cttacctttc cattgcctga aatactttca tgggctcttc agccagatgt 3901 gaatgcctta agggctgatt ctgaggccag agtgctgttt aggacatgtg ccattctatg 3961 agtctgatgt gtatcccatt tcccatgttg gaatgttctc tccattttta attctgtcag 4021 ttagtattag cagacactag tcttgtttat gtgatccctc tgactcttat gcctatcatt 4081 acgatcaatt gtgaacagaa attgatcact gggactagtg agatggcatt ggaacatggc 4141 cacctcaatg ggattgaatt cgtaatcccc tggtctgttt ctaactctac catttgaggt 4201 aagtcagttt gagcatgtcc cgaattgcac atctcttccc tctcttattc ccactcttat 4261 atttaacagg gattactttt cagttaaatt taaacaccta agaataattg tgtgttaatt 4321 acagagttca accaatagta ttaagtagaa caaccaaaaa atactaaaag ggataaagta 4381 ttacattgta catcaacagt caggacaagg gctgttcaag tcactgtttc tcatagtgtt 4441 catttcactt tgacaggttt cctttttggt gctgggtcag ttgtcactga tcagggagaa 4501 catatgatat ttgtcccttt gggactggct tatttcactc agcatgatgt gttccagatt 4561 cctccatttt gttgcaaatg accggatttc attgtttttt tttgcttcta tatagtattc 4621 tatagagtac atgtcccata atttcttcct ccagtctact gttgatgggc atttgggttg 4681 gttccaggtc ttagctattg tgaagtgagc cgcaataaac attgaggtgc agacagcttg 4741 tttgtttgcc aatttaattt cctttgggta aattccagga gcgggatggc tgggttgtat 4801 ggtagggtta tattcaggtt tctgaggatc tccagactga cttccatagg ggcttaacca 4861 gtttgcattc ccaccaacag tgggttagtg tccctttctc cccacatcct ttccagcatc 4921 tattgttggt agatttctgt atgtgagcca ttctaagcgg ggtgaggtga aacctcattg 4981 tggttttgat ttgcatttcc ctgattgcta gcgatcttga acatttcttc atgtggatgt 5041 tggccatttg gatttcctct tttcaaaaat ggcaagtgag gtccttggcc catctcttaa 5101 gtgggttgtt tgttttgatg ctgtggagtt tctttatgtc tttgtggatt ctagctatta 5161 atgctttatc tgttgcttag tttgcaaata ttttttccca ttctgtcagt tgcctcttca 5221 cttcctgact gcttcttttg cagtacagaa cttctcaatt tgatgtaatc tcaatagtta 5281 attttggctt tgactgcctg tgcctccagg gtcttttcca agaagtcttt gcggtgccaa 5341 tatcttgcag ggtttctcca atgttctcta ataacttcat ggtgtcgggt catagattta 5401 ggtctttaat ccatgttgag tggatttttg tgtaaggtgt aaggtagggg tctttcttca 5461 tgcttcagca cgtggaaatc ccagcaccat ttattgaata gactgtcctt gctccaggaa 5521 ttggttttag attcctgatc aaatataagt aggctgttgt atcccttcaa tttctttttc 5581 ttgcctaaca gctctggcta aagcctccag aaatatactg aatagcagtg gtgagaatgg 5641 atatccctgt atggtaccag atctcagtgg aaatgcttcc aactttttcc cattcaatag 5701 gatgctggtc gtgggttttt cataaattgc tttgattgta ttgaggaaca ttccttctat 5761 acccagttta cttagagttt tcaccatgaa agggtcttgt gttttattga atgctttctc 5821 tgcatctatt gagataatca tatggttttt cttctgcagt ctgttaatgt ggtgtatcac 5881 atttgcaaac acttgaacca tccctgcata ccagggttat atcccacttg gtctgggtga 5941 atgatctttc tgaaatgttg ttgcactccg ttggccagaa ttttattgag aatttttgag 6001 tctatgttca ttaggtatat tgttctgtaa ttttctttca atgctgcatc tttttccggc 6061 ttaggaatta aggtgatgct ggattcatag aaagattttg ggaggattcc ctctttttca 6121 attgttctga atagtttgag aagaattgag ttagttcttc tttaaatttc tggtagaatt 6181 cagtagtgaa tccatctggt cctgggcttt tctttgttgg gagggccttt attactgttt 6241 caatttctgc ctcagttatg ggtttgttta ggctttcgat gtcttcctgg ttcaatgtag 6301 gtaggttgca ggtgtccagg aatctatgca tttctgatag atttccctgt ttgctggcat 6361 acagtccttg tagtaatttc tgatgattct tttcatttct gtggtgtctg ctgttacatt 6421 tcctatttca tctctgattt tattgatttg gtctcttctt cttttagtta gttgagctaa 6481 tgcggtatca attttgttta ttttttcaaa aaaccagctc cccatttggc tgatttttgg 6541 taattttttt ggattcaatc ctgttgattt cttctctgat tttaattatt tctcttctcc 6601 tactagattt gggtctgctt tgctgcagtt tttctagatc cttgaggtga tttgaaagct 6661 catctatttg gtgcctttcc aatttcttga tgtaggcacc tattgatata aacttttctc 6721 ttaacactgc tttcgctgca tctcatacat tttggtatgt tgtgctgtta tcctcattta 6781 cttccagaaa gtttttgatt tctcttttga tttctttgat gacctagtgt tcattcagga 6841 gcatgttgtt cactctccat gtgtttgcat atgctgtagg gattcctgag ttgctaattt 6901 ccgacttcat tctattatgg tctgagaagc tgcatcatat gattctaatt cttttgaatg 6961 tgctgagact tgctttatgg cctagtatgt ggttaatctt agagtaggtt ccatgtactg 7021 ctgagaagaa tgtaaattct ttaagtgcag gatgaaaagt tctgtagata tgtgtcagat 7081 ccatctgggc tatagtatcc tttgaatgta ctgtttcctg tagtcttctg tcctgtgatc 7141 tgtctatttc tgagagtgga gtattgaagt cccccagtac tattgtattg gagtctaagt 7201 ctccctttaa ctctcttaac aaatctttta aataaaccgg tgccctgtaa ttaggtgcat 7261 atacattgat aatcgttata ttttttctgt tgaattcatc ccttaatcat tatgtagtgc 7321 ccctctttgt ctctcttaac agtttttgtg ctaaagttta ttttgtctga tattaagatg 7381 gttatgcctg ctcttttttc atttctgttg gcatggacta tctttctcca gcctttcaca 7441 tttcagtctg gatcgatctt tgttggaaag atgtgtttct gtaagcagca aatagatggg 7501 ttttgttcct tgaacccaat cagccaatct atgtctttta actggag //
GenBank-Updates@genbank.bio.net (05/10/91)
LOCUS RABHBBB1B2 660 bp ds-DNA MAM 10-MAY-1991 DEFINITION Rabbit beta1-globin gene (allele 2) L1 repetitive sequence. ACCESSION K03415 KEYWORDS L1 repetitive sequence; beta-1-globin; beta-globin; globin; pseudogene. SEGMENT 2 of 2 SOURCE Rabbit DNA [Mol. Biol. Evol. 3, 179-190 (1986)], clones L1Oc-[4,5] [2]. ORGANISM Oryctolagus cuniculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 660) AUTHORS Demers,G.W., Brech,K. and Hardison,R.C. TITLE Long interspersed L1 repeats in rabbit DNA are homologous to L1 repeats of rodents and primates in an open-reading-frame region JOURNAL Mol. Biol. Evol. 3, 179-190 (1986) STANDARD full staff_review COMMENT Draft entry and sequence in computer readable form for [2] kindly provided by R.C.Hardison, 15-JUL-1986. Mia 31-JUL-1986 initial entry [Mol. Biol. Evol. 3, 179-190 (1986)] FEATURES Location/Qualifiers repeat_region complement(<1..>659) /note="L10c-5 repeat" BASE COUNT 142 a 112 c 130 g 276 t ORIGIN About 2.5 kb after segment 1. 1 tagatgttgg atgattctgg tgtttcaatt ctgttccatt ggtctatcca tctgtttctg 61 taccagtacc atgctgtttg ataactactg ccctgtagta tgtcctgaag tctggtatgt 121 gactgccggc tttgtttttg tgtacaagat tgctttagct attcgaggtc tcttgtgcct 181 ccatatgaat ttcagcatca ttttttctag atcatagaag aatgtctttg gtatcttgat 241 tggtattgca ttgaatctat aaattgcttt tgggagaatg gacattttga tgatgttgat 301 cttccaatcc atgagcatgg aagatttttc cattttttgg tatcctcttc tatttctttc 361 tttaaggttt tgtaattttc atcgtagaga tctttaacgt ccttggttaa gtttattcca 421 aggtatttga ttgtttttgt agctattgtg aatgggattg atcttagcag ttctttctca 481 gccatggcat tgcttgtgta tacaaaggct gttgattttt gtgcattgat tttatatcct 541 gccactttgc caaactcctc tatgagttcc aatagtctct tagtagagtt ctttggatcc 601 tctaagtaca gaatcaatat cgtctgcaaa gagggatagt ttgacttctt ccttcttgat //