GenBank-Updates@genbank.bio.net (05/09/91)
LOCUS RABHBBB1B1 7547 bp ds-DNA MAM 09-MAY-1991
DEFINITION Rabbit beta1-globin gene (allele 2), complete cds and L1 repetitive
sequences.
ACCESSION K03256 M12603
KEYWORDS L1 repetitive sequence; beta-1-globin; beta-globin; globin;
pseudogene.
SEGMENT 1 of 2
SOURCE Rabbit DNA [Mol. Cell. Biol. 5, 147-160 (1985)], clones L1Oc-[4,5]
[Mol. Biol. Evol. 3, 179-190 (1986)].
ORGANISM Oryctolagus cuniculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 4161)
AUTHORS Rohrbaugh,M.L., Johnson,J.E., James,M.D. and Hardison,R.C.
TITLE Transcription unit of the rabbit beta1 globin gene
JOURNAL Mol. Cell. Biol. 5, 147-160 (1985)
STANDARD full staff_review
REFERENCE 2 (bases 3067 to 5556)
AUTHORS Demers,G.W., Brech,K. and Hardison,R.C.
JOURNAL Unpublished (1986) Pennsylvania State U. University Park, PA 16802
STANDARD full staff_review
REFERENCE 3 (bases 4258 to 7547)
AUTHORS Demers,G.W., Brech,K. and Hardison,R.C.
TITLE Long interspersed L1 repeats in rabbit DNA are homologous to L1
repeats of rodents and primates in an open-reading-frame region
JOURNAL Mol. Biol. Evol. 3, 179-190 (1986)
STANDARD full staff_review
COMMENT
[Unpublished (1986) Pennsylvania State U. University Park, PA
16802] revises [Mol. Biol. Evol. 3, 179-190 (1986)].
Draft entry and sequence in computer readable form for [Unpublished
(1986) Pennsylvania State U. University Park, PA 16802] kindly
provided by M.Rohrbaugh, 16-AUG-1985. Draft entry and sequence in
computer readable form for [Mol. Biol. Evol. 3, 179-190 (1986)]
kindly provided by R.C.Hardison,
15-JUL-1986.
A 'CAAT' box is present at position 350-358, a 'TATA' box is found
at position 396-401. A polyadenylation signal is located at
position 1690-1695.
The putative ancestral reading frame of the L10c repeats ended with
the stop codon at position 4479-4477 (on comp strand) and continued
beyond the end of this repeat element [Mol. Biol. Evol. 3, 179-190
(1986)]. The RNA transcribed
from the strand opposite the B1 gene (FL mRNA in sites) has been
shown to be transcribed in fetal liver nuclei. The 3' end of this
transcript has not yet been determined. An 'atg' codon is present
at position 2168-2170 (on comp strand).
FEATURES Location/Qualifiers
mRNA complement(<2245..2245)
/note="FL mRNA"
CDS join(480..571,698..920,1494..1622)
/note="beta-1 globin"
/codon_start=480
mRNA 427..1714
/note="b1-g mRNA"
exon 480..571
/note="beta-1 globin, exon 1"
intron 572..697
/note="b1-g intron A"
exon 698..920
/note="beta-1 globin, exon 2"
intron 921..1493
/note="b1-g intron B"
exon 1494..1622
/note="beta-1 globin, exon 3"
repeat_region complement(3234..7547)
/note="L10c copy 4 [Mol. Biol. Evol. 3, 179-190 (1986)],
[Unpublished (1986) Pennsylvania State U. University Park,
PA 16802]"
misc_feature 5556..5557
/note="deletion in L10c copy 4"
BASE COUNT 1861 a 1488 c 1513 g 2685 t
ORIGIN 1 bp upstream of BglII site.
1 agatctctct ctctctctct ctctctctct ctacctatct atttatctat ttaagtggat
61 ttcaacacac aaatcttctc ccttttctgt gccttaaatc ctcatttgta tgataaataa
121 ttgcagagaa aatttttcat aggcttacca ggctctaata acaaaaatta tataaataaa
181 tttggcaaga aaggtgtttt cagtagcaat tagtactgct ggtatgggtc tgggagatac
241 atagaaggaa ggctgagtct gtcagactcc taagccattg ccataactgc caaggacagg
301 ggtgctgtca tcacccagac ctcaccctgc agagccacac cctggtgttg gccaatctac
361 acacggggta gggattacat agttcaggac ttgggcataa aaggcagagc agggcagctg
421 ctgcttacac ttgcttttga cacaactgtg tttacttgca atcccccaaa acagacagaa
481 tggtgcatct gtccagtgag gagaagtctg cggtcactgc cctgtggggc aaggtgaatg
541 tggaagaagt tggtggtgag gccctgggca ggttggtatc ctttttacag cacaacttaa
601 tgagacagat agaaactggt cttgtagaaa cagagtagtc gcctgctttt ctgccaggtg
661 ctgacttctc tcccctgggc tgttttcatt ttctcaggct gctggttgtc tacccatgga
721 cccagaggtt cttcgagtcc tttggggacc tgtcctctgc acatgctgtt atgagcaatc
781 ctaaggtgaa ggctcatggc aagaaggtgc tggctgcctt cagtgagggt ctgaatcacc
841 tggacaacct caaaggcacc tttgctaagc tgagtgaact gcactgtgac aagctgcacg
901 tggatcctga gaacttcagg gtgagtttgg ggacccttga ttgttctttc tttttcgcta
961 ttgtaaaatt catgttatat ggagggggca aagttttcag ggtgttgttt agaatgggaa
1021 gatgtccctt gtatcaccat ggaccctcat gataattttg tttctttcac tttctactct
1081 gttgacaacc attgtctcct cttattttct tttcattttc tgtaactttt tcgttaaact
1141 ttagcttgca tttgtaacga atttttaaat tcacttttgt ttatttgtca gattgtaagt
1201 actttctcta atcacttttt tttcaaggca atcagggtat attatattgt acttcagcac
1261 agttttagag aacaattgtt ataattaaat gataaggtag aatatttctg catataaatt
1321 ctggctggcg tggaaatatt cttattggta gaaacaacta catcctggtc atcatcctgc
1381 ctttctcttt atggttacaa tgatatacac tgtttgagat gaggataaaa tactctgagt
1441 ccaaaccggg cccctctgct aaccatgttc atgccttctt ctttttccta cagctcctgg
1501 gcaacgtgct ggttgttgtg ctgtctcatc attttggcaa agaattcact cctcaggtgc
1561 aggctgccta tcagaaggtg gtggctggtg tggccaatgc cctggctcac aaataccact
1621 gagatctttt tccctctgcc aaaaattatg gggacatcat gaagcccctt gagcatctga
1681 cttctggcta ataaaggaaa tttattttca ttgcaatagt gtgttggaat tttttgtgtc
1741 tctcactcgg aaggacatat gggagggcaa atcatttaaa acatcagaat gagtatttgg
1801 tttagagttt ggcaacatat gccatatgct ggctgccatg aacaaaggtg gctataaaga
1861 ggtcatcagt atatgaaaca gccccctgct gtccattcct tattccatag aaaagccttg
1921 acttgaggtt agattttttt tatattttgt tttgtgttat ttttttcttt aacatcccta
1981 aaattttcct tacatgtttt actagccaga tttttcctcc tctcctgact actcccagtc
2041 atagctgtcc ctcttctctt atgaagatct tattaaagca gctgggacag ggacagaaaa
2101 agggctttga ctgcctttct cttgagccct tttcctgatc tccacaactc actgatacca
2161 ctggtctcat tggaaggggt gggctgttaa cagtgtgaca aatgtaggaa taaactggat
2221 gcaaaagggg gctttgtgca gctttatatt cactgttgtc ttaaaccctt tttatggact
2281 caaatcaaat gacagtccct caggatgtta gcttctgaat tcagaaagtg attgcagagt
2341 tgcccactcc tttatcctgt gtctgatggt tttgctgtct ctgtagtgat tagcttatgt
2401 caccatttcc tcattcaata ggcactaggt ggatgaaagg ttctggttca ctccccaaat
2461 acctgcaaca gtcaggagtg tgtcaggcca aaaccagaaa acaggaattg ccatggggtc
2521 tccatgatgg gtggcaggga ctcaagtaca tgagccatat tcggctgctt ccaggtacat
2581 tagcagaaaa ctagatcaga agtggagctg tggggaccag aataaacact ttgatatggg
2641 atgttggtgt ctcaagtagc aacttaaccc cctgctcact aaaacactct aatcctcatt
2701 acctaggagc aactgagcct gagggctatc taatatagct ggtgacacag agatcatata
2761 ccctggctaa aagcatggct gaatccatga aagaaaatat atgctcaaaa taggaataga
2821 atacacagat ttatgcacag atgcttacaa attttagcca atcctgatga catggttaac
2881 ttggagatct agatcagttc ttgccagcat gcccagagaa tagtacatgg gaaaatttat
2941 agagatgatg agttagagac aaagtgagtg ataatgacat tgcctgggat tgctgctagg
3001 tacactgaaa aatcagggag gaagatccaa taaatgaccc attcaaaatc tagaaaacct
3061 gtcaacagga actttggaaa cttatttcta atgtatctga acatcaaggc agcaataagt
3121 ctttctgtaa aatcattaaa tatgcccaaa tgtcaagttc tatgtgagtc atgaaggtaa
3181 cttgataatg ctctacactt catattttgt tcattgttta atacaaaacg caatttttat
3241 tttatttatt taatttttaa ctgtttattt aataaatata aatttccaaa ttacagctta
3301 tagattacaa tggcttcatc ctcataactt gccttgccaa cctgcaaccc tcccatctcc
3361 tgctccctct cccattccat tcacatcaag attcattttc aattatcttt atatacagaa
3421 gatcaattta gtatatatta agtaaagatt ttaacagttt gcacccacac agaacataaa
3481 gtataaatac tgtttgagta ctagttatag cattaattca cattgaacaa cacattaagg
3541 acagagatcc tacatgagga gtaagtgcac agcgactcct gtcgttgact taacaaattg
3601 acattcttgt ttagggggtc agttatctcc ccaggctcct gtcatgagtt accaaggcta
3661 tggaggcctt ttgagttcac tgacttcgat cttatttaga caaggtcata gtgaaagtgg
3721 aagtccactc ctccctttag agaacggtac ctccttcctc aatggcccat tctttcaact
3781 gggatctcgc tcacagagat ctttcattta gctcatttaa ctcctttttt tttttttttt
3841 tctagagcat cttacctttc cattgcctga aatactttca tgggctcttc agccagatgt
3901 gaatgcctta agggctgatt ctgaggccag agtgctgttt aggacatgtg ccattctatg
3961 agtctgatgt gtatcccatt tcccatgttg gaatgttctc tccattttta attctgtcag
4021 ttagtattag cagacactag tcttgtttat gtgatccctc tgactcttat gcctatcatt
4081 acgatcaatt gtgaacagaa attgatcact gggactagtg agatggcatt ggaacatggc
4141 cacctcaatg ggattgaatt cgtaatcccc tggtctgttt ctaactctac catttgaggt
4201 aagtcagttt gagcatgtcc cgaattgcac atctcttccc tctcttattc ccactcttat
4261 atttaacagg gattactttt cagttaaatt taaacaccta agaataattg tgtgttaatt
4321 acagagttca accaatagta ttaagtagaa caaccaaaaa atactaaaag ggataaagta
4381 ttacattgta catcaacagt caggacaagg gctgttcaag tcactgtttc tcatagtgtt
4441 catttcactt tgacaggttt cctttttggt gctgggtcag ttgtcactga tcagggagaa
4501 catatgatat ttgtcccttt gggactggct tatttcactc agcatgatgt gttccagatt
4561 cctccatttt gttgcaaatg accggatttc attgtttttt tttgcttcta tatagtattc
4621 tatagagtac atgtcccata atttcttcct ccagtctact gttgatgggc atttgggttg
4681 gttccaggtc ttagctattg tgaagtgagc cgcaataaac attgaggtgc agacagcttg
4741 tttgtttgcc aatttaattt cctttgggta aattccagga gcgggatggc tgggttgtat
4801 ggtagggtta tattcaggtt tctgaggatc tccagactga cttccatagg ggcttaacca
4861 gtttgcattc ccaccaacag tgggttagtg tccctttctc cccacatcct ttccagcatc
4921 tattgttggt agatttctgt atgtgagcca ttctaagcgg ggtgaggtga aacctcattg
4981 tggttttgat ttgcatttcc ctgattgcta gcgatcttga acatttcttc atgtggatgt
5041 tggccatttg gatttcctct tttcaaaaat ggcaagtgag gtccttggcc catctcttaa
5101 gtgggttgtt tgttttgatg ctgtggagtt tctttatgtc tttgtggatt ctagctatta
5161 atgctttatc tgttgcttag tttgcaaata ttttttccca ttctgtcagt tgcctcttca
5221 cttcctgact gcttcttttg cagtacagaa cttctcaatt tgatgtaatc tcaatagtta
5281 attttggctt tgactgcctg tgcctccagg gtcttttcca agaagtcttt gcggtgccaa
5341 tatcttgcag ggtttctcca atgttctcta ataacttcat ggtgtcgggt catagattta
5401 ggtctttaat ccatgttgag tggatttttg tgtaaggtgt aaggtagggg tctttcttca
5461 tgcttcagca cgtggaaatc ccagcaccat ttattgaata gactgtcctt gctccaggaa
5521 ttggttttag attcctgatc aaatataagt aggctgttgt atcccttcaa tttctttttc
5581 ttgcctaaca gctctggcta aagcctccag aaatatactg aatagcagtg gtgagaatgg
5641 atatccctgt atggtaccag atctcagtgg aaatgcttcc aactttttcc cattcaatag
5701 gatgctggtc gtgggttttt cataaattgc tttgattgta ttgaggaaca ttccttctat
5761 acccagttta cttagagttt tcaccatgaa agggtcttgt gttttattga atgctttctc
5821 tgcatctatt gagataatca tatggttttt cttctgcagt ctgttaatgt ggtgtatcac
5881 atttgcaaac acttgaacca tccctgcata ccagggttat atcccacttg gtctgggtga
5941 atgatctttc tgaaatgttg ttgcactccg ttggccagaa ttttattgag aatttttgag
6001 tctatgttca ttaggtatat tgttctgtaa ttttctttca atgctgcatc tttttccggc
6061 ttaggaatta aggtgatgct ggattcatag aaagattttg ggaggattcc ctctttttca
6121 attgttctga atagtttgag aagaattgag ttagttcttc tttaaatttc tggtagaatt
6181 cagtagtgaa tccatctggt cctgggcttt tctttgttgg gagggccttt attactgttt
6241 caatttctgc ctcagttatg ggtttgttta ggctttcgat gtcttcctgg ttcaatgtag
6301 gtaggttgca ggtgtccagg aatctatgca tttctgatag atttccctgt ttgctggcat
6361 acagtccttg tagtaatttc tgatgattct tttcatttct gtggtgtctg ctgttacatt
6421 tcctatttca tctctgattt tattgatttg gtctcttctt cttttagtta gttgagctaa
6481 tgcggtatca attttgttta ttttttcaaa aaaccagctc cccatttggc tgatttttgg
6541 taattttttt ggattcaatc ctgttgattt cttctctgat tttaattatt tctcttctcc
6601 tactagattt gggtctgctt tgctgcagtt tttctagatc cttgaggtga tttgaaagct
6661 catctatttg gtgcctttcc aatttcttga tgtaggcacc tattgatata aacttttctc
6721 ttaacactgc tttcgctgca tctcatacat tttggtatgt tgtgctgtta tcctcattta
6781 cttccagaaa gtttttgatt tctcttttga tttctttgat gacctagtgt tcattcagga
6841 gcatgttgtt cactctccat gtgtttgcat atgctgtagg gattcctgag ttgctaattt
6901 ccgacttcat tctattatgg tctgagaagc tgcatcatat gattctaatt cttttgaatg
6961 tgctgagact tgctttatgg cctagtatgt ggttaatctt agagtaggtt ccatgtactg
7021 ctgagaagaa tgtaaattct ttaagtgcag gatgaaaagt tctgtagata tgtgtcagat
7081 ccatctgggc tatagtatcc tttgaatgta ctgtttcctg tagtcttctg tcctgtgatc
7141 tgtctatttc tgagagtgga gtattgaagt cccccagtac tattgtattg gagtctaagt
7201 ctccctttaa ctctcttaac aaatctttta aataaaccgg tgccctgtaa ttaggtgcat
7261 atacattgat aatcgttata ttttttctgt tgaattcatc ccttaatcat tatgtagtgc
7321 ccctctttgt ctctcttaac agtttttgtg ctaaagttta ttttgtctga tattaagatg
7381 gttatgcctg ctcttttttc atttctgttg gcatggacta tctttctcca gcctttcaca
7441 tttcagtctg gatcgatctt tgttggaaag atgtgtttct gtaagcagca aatagatggg
7501 ttttgttcct tgaacccaat cagccaatct atgtctttta actggag
//GenBank-Updates@genbank.bio.net (05/10/91)
LOCUS RABHBBB1B2 660 bp ds-DNA MAM 10-MAY-1991
DEFINITION Rabbit beta1-globin gene (allele 2) L1 repetitive sequence.
ACCESSION K03415
KEYWORDS L1 repetitive sequence; beta-1-globin; beta-globin; globin;
pseudogene.
SEGMENT 2 of 2
SOURCE Rabbit DNA [Mol. Biol. Evol. 3, 179-190 (1986)], clones L1Oc-[4,5]
[2].
ORGANISM Oryctolagus cuniculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 660)
AUTHORS Demers,G.W., Brech,K. and Hardison,R.C.
TITLE Long interspersed L1 repeats in rabbit DNA are homologous to L1
repeats of rodents and primates in an open-reading-frame region
JOURNAL Mol. Biol. Evol. 3, 179-190 (1986)
STANDARD full staff_review
COMMENT Draft entry and sequence in computer readable form for [2] kindly
provided by R.C.Hardison, 15-JUL-1986.
Mia 31-JUL-1986 initial entry [Mol. Biol. Evol. 3, 179-190
(1986)]
FEATURES Location/Qualifiers
repeat_region complement(<1..>659)
/note="L10c-5 repeat"
BASE COUNT 142 a 112 c 130 g 276 t
ORIGIN About 2.5 kb after segment 1.
1 tagatgttgg atgattctgg tgtttcaatt ctgttccatt ggtctatcca tctgtttctg
61 taccagtacc atgctgtttg ataactactg ccctgtagta tgtcctgaag tctggtatgt
121 gactgccggc tttgtttttg tgtacaagat tgctttagct attcgaggtc tcttgtgcct
181 ccatatgaat ttcagcatca ttttttctag atcatagaag aatgtctttg gtatcttgat
241 tggtattgca ttgaatctat aaattgcttt tgggagaatg gacattttga tgatgttgat
301 cttccaatcc atgagcatgg aagatttttc cattttttgg tatcctcttc tatttctttc
361 tttaaggttt tgtaattttc atcgtagaga tctttaacgt ccttggttaa gtttattcca
421 aggtatttga ttgtttttgt agctattgtg aatgggattg atcttagcag ttctttctca
481 gccatggcat tgcttgtgta tacaaaggct gttgattttt gtgcattgat tttatatcct
541 gccactttgc caaactcctc tatgagttcc aatagtctct tagtagagtt ctttggatcc
601 tctaagtaca gaatcaatat cgtctgcaaa gagggatagt ttgacttctt ccttcttgat
//