[bionet.molbio.genbank.updates] Rabbit alpha-1-globin gene to theta-1-globin pseudogene region

GenBank-Updates@genbank.bio.net (05/22/91)

LOCUS       RABATGL1     4028 bp ds-DNA             MAM       22-MAY-1991
DEFINITION  Rabbit alpha-1-globin gene to theta-1-globin pseudogene region
ACCESSION   X04751
KEYWORDS    alpha-1-globin; alpha-globin; globin; pseudogene;
            repetitive sequence; tandem repeat; theta-1-globin; theta-globin.
SOURCE      Oryctolagus cuniculus DNA.
  ORGANISM  Oryctolagus cuniculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 4028)
  AUTHORS   Cheng,J.-F., Raid,L. and Hardison,R.C.
  TITLE     Isolation and nucleotide sequence of the rabbit globin gene cluster
            psi-zeta-alpha-1-psi-alpha: Absence of a pair of alpha-globin genes
            evolving in concert
  JOURNAL   J. Biol. Chem. 261, 839-848 (1986)
  STANDARD  full automatic
REFERENCE   2  (bases 1 to 4028)
  AUTHORS   Hardison,R.C.
  JOURNAL   Unpublished (1987)
  STANDARD  full automatic
COMMENT     SWISS-PROT; P01948; HBA$RABIT.
            
            Submitted data [2] include some corrections to published seq. [1].
            Referring to the authors the sequence from pos. 50 to 70 may not be
            completely accurate due to reading problems of the sequencing gels.
            Theta-1 pseudogene was formerly called psi alpha. Data kindly
            reviewed (15-Jun-1987) by Hardison R.C.
            
            From EMBL 26   entry OCATGL1;  dated 22-APR-1990.
FEATURES             Location/Qualifiers
     precursor_RNA   150..861
                     /note="primary transcript od alpha-1-globin"
     mRNA            150..280
                     /note="exon 1"
     CDS             186..280
                     /note="alpha-1-globin (AA 1-32) (280 is 2nd base in
                     codon)"
                     /codon_start=186
     intron          281..357
                     /note="intron I"
     mRNA            358..562
                     /note="exon 2"
     CDS             358..562
                     /note="alpha-1-globin (AA 33-100) (358 is 3rd base in
                     codon)"
                     /codon_start=358
     intron          563..645
                     /note="intron II"
     mRNA            646..861
                     /note="exon 3"
     CDS             646..771
                     /note="alpha-1-globin (AA 101-142)"
                     /codon_start=646
     misc_feature    841..846
                     /note="put. polyA signal"
     polyA_site      861..861
                     /note="polyA site"
     repeat_region   1542..1675
                     /note="region of 5 x 25bp tandem repeat 1"
     repeat_region   3067..3133
                     /note="region of 7 tandem repeat 2 (9-10bp)"
     misc_feature    3139..3744
                     /note="pseudogene theta-1-globin"
     misc_feature    3803..3808
                     /note="put. polyA signal"
     polyA_site      3818..3818
                     /note="put. polyA site (found by homology to alpha-1)"
BASE COUNT      685 a   1359 c   1310 g    674 t
ORIGIN
        1 gcggggccgg gtcccaggca gacgccgcga gggcgccccc agcggtggcg gccgccgccg
       61 cgccccgccg cgccggccaa tgagcggggc cccgctgggc gtgcccgcag cacctcgggc
      121 ttaaaagcgc cgcgcagtct gggctccgca cacttctggt ccagtccgac tgagaaggaa
      181 ccaccatggt gctgtctccc gctgacaaga ccaacatcaa gactgcctgg gaaaagatcg
      241 gcagccacgg tggcgagtat ggcgccgagg ccgtggagag gtgaggaccc ccgccccgcc
      301 ccgccccgcc cgagcccgcc ggcgccgcgc cccgctcacg gcctcctgtc cccgcaggat
      361 gttcttgggc ttccccacca ccaagaccta cttcccccac ttcgacttca cccacggctc
      421 tgagcagatc aaagcccacg gcaagaaggt gtccgaagcc ctgaccaagg ccgtgggcca
      481 cctggacgac ctgcccggcg ccctgtctac tctcagcgac ctgcacgcgc acaagctgcg
      541 ggtggacccg gtgaatttca aggtgagccc gcagcccggc tgggagcgtc gcgggggtcg
      601 gcggtccccg accacaccca ccgacgtccg cccctctctc tgcagctcct gtcccactgc
      661 ctgctggtga ccctggccaa ccaccacccc agtgaattca cccctgcggt gcacgcctcc
      721 ctggacaagt tcctggccaa cgtgagcacc gtgctgacct ccaaatatcg ttaagctgga
      781 gcctgggagc cggcctggcc ctccgccccc cccacccccg cagcccaccc ctggtctttg
      841 aataaagtct gagtgagtgg ccgacagtgc ccgtggagtt ctcgtgacct gaggtgcagg
      901 gccggcctag ggacacgtcc gtgcacgtgc cgaggccccc tgtgcagctg caagggacag
      961 gagtgggcaa ccggctggtt ccttccttcc tgcttgcaag tccacgaggg gctgctgaaa
     1021 gaacccccca cacacacatg cacacactcg tgccactcgg ctgcctccag cctgggtccc
     1081 cggctccccc agatctcggg ggggcactgg ctctccctca gcctcccaaa cgtacccacc
     1141 cacccaccca cccacggtgc agacaaaacc ggaggtcgag tgcaggctgc agatcccagc
     1201 agcacccggg gacgctcact cctaagaccc ttaggtcgcg cttggggcca gtgaggccca
     1261 gtgcccacgt ggccaccctg gggctggcac ccctgccttg aggcagcggg ggcccggggt
     1321 ggacagtgcc cgcggcaggc ttccttcctg aagagggagg tttgccgtgc catccagccc
     1381 ctggctaaca ccagtgtcct ctcacgccca gtctggggct cctccttgga ggacaccgtg
     1441 gcagcccctt gggcacctcg ggggcagtgg gagccgtggg aaggggctgt cttcgctcct
     1501 tgagaggaag ggagacaggt gagggtgggg cgggacaggt gcacctgagc aggtgaatgg
     1561 gcagactgtg gtgccaccgt agccaggaat ggtggagcac cgccgtagcc gggaatggtg
     1621 gggcaccgcc gtagccggga atggtggggc accgccgtag ccgggaatgg tggggcacgg
     1681 ctgaacctgc aacactgcct gctgaggagc agccgggcgc aggagcccac ccactggggt
     1741 ggagaccccg cttctccaac cagacgccca gctccgtgca gctcaggttg gggagcagtg
     1801 gtcatcgatg accaggctgg agactcggct tcttagccgc tggcttgctt cctctgctcc
     1861 cgcctgggtt ttgtggtcag tcagcagaag ggcggggggg gggggctcca gtgcccaggt
     1921 ctgtgggagg ggtggaggca ctgtgagggg accacttggg ggtgcggctg gcagggcgtg
     1981 accccatgtg ctctgtgggt ctcctggagt tccattcagg gacgtggccc ccacaagtgc
     2041 cagggctcag cagtgggaga cacactgccc ggaggcggca cacccacatt aggtggacca
     2101 cagacgccag tcctctgctg gccccggctg tgtccggctt cccctgaccc ccgcgtgccc
     2161 tctcgggtct agggccacct ctgcagcaag cagaggcgct cacttgcctg agaatcacgg
     2221 caggccagtc ctgcttggtt taacccagag tggacactga taagtgtcat aagtagaaag
     2281 tatagctaat tggcgtcatg ggtatacagc tgctatttag taggttagga atttgtgtgt
     2341 gtggctgtct ctgtaattac aattacaacc tcagtgcctt aagtcatcaa cactcagctt
     2401 ataatgtctg tgtgcatctt gtttcataat tggataatga atctatattc aaattaatgt
     2461 aacgttgatt tctgtccaag aaaaataaat gcaagcattt aaaaaatcta tgactttttt
     2521 ttaaaagtcc acatgttgaa taatcccatt tattaaacac acacacacac acacaacaag
     2581 caaatccgtg gaaacagaga ggaggttggt gggctggagg aggggctgga ggcactgccc
     2641 cggcagtttg ggagtagagg tggggagggt cgcacgcgct ggcttgacag ctcagtgtgg
     2701 gagctgcaag gctcggctag gcactcagca ggtgcaggtg ttggccgccc gcaacggaac
     2761 tcctgctgcg agccaccccg accggccgcg cggcggccca gcccgggagt cgctgtcacc
     2821 atctcgcgca gcgcccgcgc tctgccgggg ttccgcgtcc tgtccaggtc tccctctgcg
     2881 cgtgtgcata acatgtgtct ccactgaatg tttcaaatgt gtgttttgct gaaaggcctg
     2941 gggttcagag cgagcccgaa agtggcggac cgagactgcg tgcgtgcgcg ggcctccggg
     3001 tgcgcgcggc ggcacacgtg tcgggaacgg gcctgcgcca cgcccccaga ggcccgcggg
     3061 gacccggccc gccgcgcccg ccgcgcccgc cgcgcccgcc gctgcccgcc gctgcccgcc
     3121 gctgcccgcc gctgcgggat ggcgctgtcg gcggcggagc gggcgctgct gcgcgccctg
     3181 tggaagaagc tggggagcaa cgtgggcgtc tacgcgaccg aggccctgga gaggtgcgca
     3241 ccgggagggc gcccccggcc cgccgcgccc cgcgccgcgg ggcccccaca cgcaccacat
     3301 ccccctcctc ccgcagaacc ttggaggcct tcccgcgcac caagatctac ttctcccaca
     3361 tggacctgag cccgggctcc gccaggtcag agcccacggc cgcaaggtgg ccgacgcgct
     3421 gaccctcgcc gcagaccacc tggacgacct gcccggcgcc ctgtccgctc tgagcgacct
     3481 gcacgtgcgc acgctgcgcg tggaccccca ccacttcggg gtgagcgccg ggaaccttcc
     3541 accggggagg gggctcccct aggcggggtg ggggaggaga atcgatggac cgcgagcggg
     3601 aacgacccct ccctgcagct gctgggccac tgtctgctgg tgaccctcgc ccggcactac
     3661 cctggagact tcggccccgc catgcacgcc tcggtggaca aattcctgca ccacgtgatc
     3721 tcggcgctga cctccaagta ccgctgaatg gagggtggga ggtcgtggga cgccccgccc
     3781 cccgtcgacg ccgtcggctt ggagtaaagc cccggggcag cagcctgaac cgagtgctcc
     3841 ctggggattg cgtgtgtggg gatggcctcg ggtccgcaaa ccaaggggct ggcgggtttg
     3901 gggcgtccag gtcccaaatt ccaattcctt ggccttggcc aggagggtgg caggcgggag
     3961 gtggtcgggg ggctgttgat gcccagtcca ggcccttcgc agtactgctc gcttagtcct
     4021 cctgactc
//