GenBank-Updates@genbank.bio.net (05/22/91)
LOCUS RABATGL1 4028 bp ds-DNA MAM 22-MAY-1991
DEFINITION Rabbit alpha-1-globin gene to theta-1-globin pseudogene region
ACCESSION X04751
KEYWORDS alpha-1-globin; alpha-globin; globin; pseudogene;
repetitive sequence; tandem repeat; theta-1-globin; theta-globin.
SOURCE Oryctolagus cuniculus DNA.
ORGANISM Oryctolagus cuniculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE 1 (bases 1 to 4028)
AUTHORS Cheng,J.-F., Raid,L. and Hardison,R.C.
TITLE Isolation and nucleotide sequence of the rabbit globin gene cluster
psi-zeta-alpha-1-psi-alpha: Absence of a pair of alpha-globin genes
evolving in concert
JOURNAL J. Biol. Chem. 261, 839-848 (1986)
STANDARD full automatic
REFERENCE 2 (bases 1 to 4028)
AUTHORS Hardison,R.C.
JOURNAL Unpublished (1987)
STANDARD full automatic
COMMENT SWISS-PROT; P01948; HBA$RABIT.
Submitted data [2] include some corrections to published seq. [1].
Referring to the authors the sequence from pos. 50 to 70 may not be
completely accurate due to reading problems of the sequencing gels.
Theta-1 pseudogene was formerly called psi alpha. Data kindly
reviewed (15-Jun-1987) by Hardison R.C.
From EMBL 26 entry OCATGL1; dated 22-APR-1990.
FEATURES Location/Qualifiers
precursor_RNA 150..861
/note="primary transcript od alpha-1-globin"
mRNA 150..280
/note="exon 1"
CDS 186..280
/note="alpha-1-globin (AA 1-32) (280 is 2nd base in
codon)"
/codon_start=186
intron 281..357
/note="intron I"
mRNA 358..562
/note="exon 2"
CDS 358..562
/note="alpha-1-globin (AA 33-100) (358 is 3rd base in
codon)"
/codon_start=358
intron 563..645
/note="intron II"
mRNA 646..861
/note="exon 3"
CDS 646..771
/note="alpha-1-globin (AA 101-142)"
/codon_start=646
misc_feature 841..846
/note="put. polyA signal"
polyA_site 861..861
/note="polyA site"
repeat_region 1542..1675
/note="region of 5 x 25bp tandem repeat 1"
repeat_region 3067..3133
/note="region of 7 tandem repeat 2 (9-10bp)"
misc_feature 3139..3744
/note="pseudogene theta-1-globin"
misc_feature 3803..3808
/note="put. polyA signal"
polyA_site 3818..3818
/note="put. polyA site (found by homology to alpha-1)"
BASE COUNT 685 a 1359 c 1310 g 674 t
ORIGIN
1 gcggggccgg gtcccaggca gacgccgcga gggcgccccc agcggtggcg gccgccgccg
61 cgccccgccg cgccggccaa tgagcggggc cccgctgggc gtgcccgcag cacctcgggc
121 ttaaaagcgc cgcgcagtct gggctccgca cacttctggt ccagtccgac tgagaaggaa
181 ccaccatggt gctgtctccc gctgacaaga ccaacatcaa gactgcctgg gaaaagatcg
241 gcagccacgg tggcgagtat ggcgccgagg ccgtggagag gtgaggaccc ccgccccgcc
301 ccgccccgcc cgagcccgcc ggcgccgcgc cccgctcacg gcctcctgtc cccgcaggat
361 gttcttgggc ttccccacca ccaagaccta cttcccccac ttcgacttca cccacggctc
421 tgagcagatc aaagcccacg gcaagaaggt gtccgaagcc ctgaccaagg ccgtgggcca
481 cctggacgac ctgcccggcg ccctgtctac tctcagcgac ctgcacgcgc acaagctgcg
541 ggtggacccg gtgaatttca aggtgagccc gcagcccggc tgggagcgtc gcgggggtcg
601 gcggtccccg accacaccca ccgacgtccg cccctctctc tgcagctcct gtcccactgc
661 ctgctggtga ccctggccaa ccaccacccc agtgaattca cccctgcggt gcacgcctcc
721 ctggacaagt tcctggccaa cgtgagcacc gtgctgacct ccaaatatcg ttaagctgga
781 gcctgggagc cggcctggcc ctccgccccc cccacccccg cagcccaccc ctggtctttg
841 aataaagtct gagtgagtgg ccgacagtgc ccgtggagtt ctcgtgacct gaggtgcagg
901 gccggcctag ggacacgtcc gtgcacgtgc cgaggccccc tgtgcagctg caagggacag
961 gagtgggcaa ccggctggtt ccttccttcc tgcttgcaag tccacgaggg gctgctgaaa
1021 gaacccccca cacacacatg cacacactcg tgccactcgg ctgcctccag cctgggtccc
1081 cggctccccc agatctcggg ggggcactgg ctctccctca gcctcccaaa cgtacccacc
1141 cacccaccca cccacggtgc agacaaaacc ggaggtcgag tgcaggctgc agatcccagc
1201 agcacccggg gacgctcact cctaagaccc ttaggtcgcg cttggggcca gtgaggccca
1261 gtgcccacgt ggccaccctg gggctggcac ccctgccttg aggcagcggg ggcccggggt
1321 ggacagtgcc cgcggcaggc ttccttcctg aagagggagg tttgccgtgc catccagccc
1381 ctggctaaca ccagtgtcct ctcacgccca gtctggggct cctccttgga ggacaccgtg
1441 gcagcccctt gggcacctcg ggggcagtgg gagccgtggg aaggggctgt cttcgctcct
1501 tgagaggaag ggagacaggt gagggtgggg cgggacaggt gcacctgagc aggtgaatgg
1561 gcagactgtg gtgccaccgt agccaggaat ggtggagcac cgccgtagcc gggaatggtg
1621 gggcaccgcc gtagccggga atggtggggc accgccgtag ccgggaatgg tggggcacgg
1681 ctgaacctgc aacactgcct gctgaggagc agccgggcgc aggagcccac ccactggggt
1741 ggagaccccg cttctccaac cagacgccca gctccgtgca gctcaggttg gggagcagtg
1801 gtcatcgatg accaggctgg agactcggct tcttagccgc tggcttgctt cctctgctcc
1861 cgcctgggtt ttgtggtcag tcagcagaag ggcggggggg gggggctcca gtgcccaggt
1921 ctgtgggagg ggtggaggca ctgtgagggg accacttggg ggtgcggctg gcagggcgtg
1981 accccatgtg ctctgtgggt ctcctggagt tccattcagg gacgtggccc ccacaagtgc
2041 cagggctcag cagtgggaga cacactgccc ggaggcggca cacccacatt aggtggacca
2101 cagacgccag tcctctgctg gccccggctg tgtccggctt cccctgaccc ccgcgtgccc
2161 tctcgggtct agggccacct ctgcagcaag cagaggcgct cacttgcctg agaatcacgg
2221 caggccagtc ctgcttggtt taacccagag tggacactga taagtgtcat aagtagaaag
2281 tatagctaat tggcgtcatg ggtatacagc tgctatttag taggttagga atttgtgtgt
2341 gtggctgtct ctgtaattac aattacaacc tcagtgcctt aagtcatcaa cactcagctt
2401 ataatgtctg tgtgcatctt gtttcataat tggataatga atctatattc aaattaatgt
2461 aacgttgatt tctgtccaag aaaaataaat gcaagcattt aaaaaatcta tgactttttt
2521 ttaaaagtcc acatgttgaa taatcccatt tattaaacac acacacacac acacaacaag
2581 caaatccgtg gaaacagaga ggaggttggt gggctggagg aggggctgga ggcactgccc
2641 cggcagtttg ggagtagagg tggggagggt cgcacgcgct ggcttgacag ctcagtgtgg
2701 gagctgcaag gctcggctag gcactcagca ggtgcaggtg ttggccgccc gcaacggaac
2761 tcctgctgcg agccaccccg accggccgcg cggcggccca gcccgggagt cgctgtcacc
2821 atctcgcgca gcgcccgcgc tctgccgggg ttccgcgtcc tgtccaggtc tccctctgcg
2881 cgtgtgcata acatgtgtct ccactgaatg tttcaaatgt gtgttttgct gaaaggcctg
2941 gggttcagag cgagcccgaa agtggcggac cgagactgcg tgcgtgcgcg ggcctccggg
3001 tgcgcgcggc ggcacacgtg tcgggaacgg gcctgcgcca cgcccccaga ggcccgcggg
3061 gacccggccc gccgcgcccg ccgcgcccgc cgcgcccgcc gctgcccgcc gctgcccgcc
3121 gctgcccgcc gctgcgggat ggcgctgtcg gcggcggagc gggcgctgct gcgcgccctg
3181 tggaagaagc tggggagcaa cgtgggcgtc tacgcgaccg aggccctgga gaggtgcgca
3241 ccgggagggc gcccccggcc cgccgcgccc cgcgccgcgg ggcccccaca cgcaccacat
3301 ccccctcctc ccgcagaacc ttggaggcct tcccgcgcac caagatctac ttctcccaca
3361 tggacctgag cccgggctcc gccaggtcag agcccacggc cgcaaggtgg ccgacgcgct
3421 gaccctcgcc gcagaccacc tggacgacct gcccggcgcc ctgtccgctc tgagcgacct
3481 gcacgtgcgc acgctgcgcg tggaccccca ccacttcggg gtgagcgccg ggaaccttcc
3541 accggggagg gggctcccct aggcggggtg ggggaggaga atcgatggac cgcgagcggg
3601 aacgacccct ccctgcagct gctgggccac tgtctgctgg tgaccctcgc ccggcactac
3661 cctggagact tcggccccgc catgcacgcc tcggtggaca aattcctgca ccacgtgatc
3721 tcggcgctga cctccaagta ccgctgaatg gagggtggga ggtcgtggga cgccccgccc
3781 cccgtcgacg ccgtcggctt ggagtaaagc cccggggcag cagcctgaac cgagtgctcc
3841 ctggggattg cgtgtgtggg gatggcctcg ggtccgcaaa ccaaggggct ggcgggtttg
3901 gggcgtccag gtcccaaatt ccaattcctt ggccttggcc aggagggtgg caggcgggag
3961 gtggtcgggg ggctgttgat gcccagtcca ggcccttcgc agtactgctc gcttagtcct
4021 cctgactc
//