[bionet.molbio.genbank.updates] Human UbA52 gene coding for ubiquitin-52 amino acid fusion protein

GenBank-Updates@genbank.bio.net (05/22/91)

LOCUS       HUMUBA52G    4555 bp ds-DNA             PRI       22-MAY-1991
DEFINITION  Human UbA52 gene coding for ubiquitin-52 amino acid fusion protein
ACCESSION   X56997
KEYWORDS    UbA52 gene; ubiquitin-fusion protein.
SOURCE      Homo sapiens DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 4555)
  AUTHORS   Baker,R.T.
  JOURNAL   Unpublished (1990)
  STANDARD  full automatic
REFERENCE   2  (bases 1 to 4555)
  AUTHORS   Baker,R.T. and Board,P.G.
  TITLE     The human ubiquitin-52 amino acid fusion protein gene shares
            several structural features with mammalian ribosomal protein genes
  JOURNAL   Nucleic Acids Res. 19, 1035-1040 (1991)
  STANDARD  full automatic
COMMENT     See <X56997-9>, <Y00361> for related sequences.
            
            From EMBL 26   entry HSUBA52G;  dated 26-MAR-1991.
FEATURES             Location/Qualifiers
     mRNA            join(934..962,2363..2473,2733..2819,3942..4044,4129..4312)
                     /note="in adrenal gland cDNA clone"
                     /gene="UbA52"
     mRNA            join(934..962,2363..2473,2733..2819,3942..4044,4129..4306)
                     /note="in placental gland cDNA clone"
                     /gene="UbA52"
     variation       replace(4226..4226,4226..4226)
                     /note="in adrenal/placental cDNAs"
     CDS             join(2371..2473,2733..2819,3942..4044,4129..4222)
                     /product="ubiquitin-52 amino acid fusion protein"
                     /gene="UbA52"
                     /codon_start=2371
     repeat_unit     114..406
                     /rpt_family="Alu"
     repeat_unit     743..758
                     /rpt_type=direct
     misc_binding    747..752
                     /bound_moiety="Sp1"
     repeat_unit     827..843
                     /rpt_type=direct
     misc_binding    832..837
                     /bound_moiety="Sp1"
     repeat_unit     932..944
                     /rpt_type=inverted
                     /note="pyrimidine tract"
     exon            934..962
                     /number=1
     exon            934..962
     exon            934..962
     prim_transcript 934..4312
                     /gene="UbA52"
     intron          963..2362
                     /number=1
     misc_binding    1037..1042
                     /bound_moiety="Sp1"
     misc_binding    1208..1213
                     /bound_moiety="Sp1"
     repeat_unit     complement(1415..1703)
                     /rpt_family="Alu"
     exon            2363..2473
                     /number=2
     exon            2363..2473
     exon            2363..2473
     intron          2474..2732
                     /number=2
     exon            2733..2819
                     /number=3
     exon            2733..2819
     exon            2733..2819
     exon            2733..2819
     intron          2820..3941
                     /number=3
     repeat_unit     2939..3296
                     /rpt_family="Alu"
     repeat_unit     complement(3373..3623)
                     /rpt_family="Alu"
     exon            3942..4044
                     /number=4
     exon            3942..4044
     exon            3942..4044
     exon            3942..4044
     intron          4045..4128
                     /number=4
     exon            4129..4312
                     /number=5
     exon            4129..4312
     polyA_signal    4280..4285
BASE COUNT      970 a   1178 c   1341 g   1066 t
ORIGIN
        1 ggatccgcac atctcggcct cccaaagtgc aggcgtgagt caccggaccc aggtcccgcc
       61 ctggcacttt ttaaccaccc acaaatctgg atcctacact gaaaagagac actgcagtgg
      121 ctcacgtctg taatcccagc actttgggag gccaaggcgg gcggatcacc tgaggtcgcg
      181 agtttgagac cagcctgacc aacatggaga aaccccgtct ctactaaaaa tacaaaagtg
      241 gccaggcatg gtgtcgcaca cttgtaatcg cagctactcg ggaggctgag gcaggagaat
      301 tgcttgaacc caggaggcgg aggttgcggt gagccgagat cgcgccattg cactacagcc
      361 tgggcaacga gagcgaaact ccgtctcaaa aaaaaaaaaa aaaaaatcct gagtcccgct
      421 tgacaccttt tgtcaggcac caccaccttt ctgggcgaat gcggtagtac cgtctgctct
      481 ccctgctgct gtcctgaaat ccattcaggc acagcggccg agagctttat aataaccgat
      541 tccaggtgtt aggtgctttc ccagccccga ctcctgcgtc ctggacccgc agtcctctgc
      601 ttaatacctt tgctttatta gaaaacattc tcctctactc cgttcagcta ttcgctgagg
      661 gcccgccaac cgccagcggt tgtcaatggc ctagaggcag cggacgcaaa cacggggaga
      721 ggtgcaatcg tctcaagtga ctcggcgggc ggggcccaca accggaagcg ggtgggcgac
      781 cttcacccac gtgcgctgcg gcttcgttcg ccagcatcca agatggcggc agggcggggc
      841 ccaaggcgcg gcgcgaattg tgacgcaggc gtccggcgtg ctccgtcgca agcgctttcg
      901 gcggcgatta ggtggtttcc ggttccgcta tcttcttttt cttcagcgag gcggccgagc
      961 tggttggtgg cggcggtcgt gcgggttcgc gccgggccga gagcgggttg ggggctgcgg
     1021 gaggctgcag gggcctgggc ggcagaagag gcggccctga gctggctcat gcgggccagt
     1081 ctcggcaggg tggctgggca gggctcgcga ggccacggct cggagcccag accggggccc
     1141 aggagcgaac gccgttttgg agaggagcct gcctgctctg cctgccagcg tgaccccacg
     1201 aggcctcggg cgggaagagg tcctcggggc agatccgagt taatgagaga ggggtattga
     1261 gcgtgtagcg ttaactctgc cagtcactgc gtcagtcgct ttggaaatac taaatttctc
     1321 gagctgagtc ttcatacctg gctcctaatc tacgtctgta aggaggagct ggtggtagtg
     1381 tctgcttttt agacttttct ttagactatt tgtatttttt tcagatggag tcttgctctg
     1441 tcgcctaagc tggagttcag tggtgcggtc tcggctcact gcaatctcca cctcccgggc
     1501 tcgagcgatt cttctgcctg agcctcccga gtagctggga ttataggcgc ctggcaccac
     1561 gcccagttga tttttgtagt tttagtagag acggagtttc accatgttag ccaggctcat
     1621 cttgaactct tgacctcaaa tgatccgtct gcctcggcct tccaaagtgc tgggattaca
     1681 ggcatgagcc cctgcgcccg gtcgattctt tgtcttttta agtcaacttt tatatgtgaa
     1741 caatgcttgg caggtggttg gtagatacta agtgatgttc gtggtttggg gtcaaggcaa
     1801 gaagtggggt ctggagagtt ttggtgtaat tgagaaggaa gctaagagtg ttgggtgctc
     1861 cagcttggag ttagagagga gagaggctgc cacaggaaga catgtgtgtt gtaggggatg
     1921 gcttcccatc caggctggca gcaggagcag cctgtgcaga tcaggacctt gctccctgga
     1981 agagggtgga ccgccttcag ggaagatgga tctagcaaga tgatgccaaa gggtacttat
     2041 tccatcagga gatactgacg agtccttccg ccgctaaacc taaggtgaat aaccacagtc
     2101 tgtgttcctg aagagcaccc gtgcggtcag gagggtggag gacatgtgat cttagttcca
     2161 ggacatgttt agactacagg ccagggtgtg tgagaagcct agcagggcca ggcttggagg
     2221 agtgaaagga agacaggtac tggggcagga ccagttggac ttggtgcagg caaagggata
     2281 gcaactgtgg tgtaggcacc tgagcttgtg ctactcaggc atgcattgct caccagtcta
     2341 tcctgccgcc catcctcctc agacgcaaac atgcagatct ttgtgaagac cctcactggc
     2401 aaaaccatca cccttgaggt cgagcccagt gacaccattg agaatgtcaa agccaaaatt
     2461 caagacaagg agggtgagta gggctgggtg tgggggctct ggctgtgaac tgggagtccc
     2521 tctctcgccc aggggagtct cagtcctgtg tgggttgtgc tgactttaga tctgttttgc
     2581 ccttgcttct ccatgtgatc tgaagaacgt ttgttatctt ctacctcagt tggccttttg
     2641 agaaactggg ggtagtgctg gagctcccct gcagaggaca ctgccagtaa tatggtccgc
     2701 agagcctcta actgagcctc cctccccctc aggtatccca cctgaccagc agcgtctgat
     2761 atttgccggc aaacagctgg aggatggccg cactctctca gactacaaca tccagaaagg
     2821 taccggggtt ggggttgctg ggcagggacc caagatcccc aggtcctagg aaaggagcat
     2881 tgatggcctc aggggttggg gagcagttca aatgacttgt gttttgttta aataatggga
     2941 ctgggcacag tggctcatgc ctgtaatccc ggcactttgg gaggcttagg cgggtggatc
     3001 acctgaggtc aggagttcaa gaccagcctg gacaacgtgg tgaaatcccg tttctattaa
     3061 aaatacaaaa atcagctggg tgcagtggct caggcctgta atcccagcac ttcgggaggc
     3121 tgaggcgggc agatcacaag gtcaagagat tgagatcatc atgaccaaca tggtgaaatc
     3181 ccatctctac taaaaataca aaaattagct aggcatggtg gtgcgtgcct gtagtcccag
     3241 ctactcagga ggctgaggaa ggagaattgc ttgaactcgg gagacaaaaa aaaaaagtca
     3301 taatgtgaat ttttttatca ctgcaataag gaaattagtg tcacttgtgg gagcgacaag
     3361 aattcagtgt cctttttttg tgagacagag tcttactctg tcacccaggc tggagtgcag
     3421 tgacgcgatc tcactgtgac ctccgtctcc cgggttcaag cgattcccct gcctcagcct
     3481 cccgagtagc tgggattaca ggcacccgcc accacgccca gctaattttt tttgtatttt
     3541 tagtagagac agggtttcac tacgttggcc aggctggtct cttaaagtgc taggattaca
     3601 ggcgtgagcc atggtgcccc gcctagactt cagtgtctga ccttgcctga accacttaga
     3661 ggtcggcttc catgttagaa acccagatgg atgcctcagt tggcatgtgt cagtctcaga
     3721 ctccccccag ggctcgtggt cagtgctgag atggagattt cctggggcag gctggctggg
     3781 acagtgtatc atccacacgt agaacgacgg cgggggatcc cgacttggtg tccccatcac
     3841 acttgagaaa gcagcagact ataggccctg gagggtcctg cccctgtgac tgaggagcca
     3901 gggctgggct cagtcgccgt ccttctggct gtctcctgca gagtccaccc tgcacctggt
     3961 gttgcgcctg cgaggtggca ttattgagcc ttctctccgc cagcttgccc agaaatacaa
     4021 ctgcgacaag atgatctgcc gcaagtatgt gtgctccgat gcttgggggg ctgtgggggc
     4081 tgccggagtc ggggtatgcc ctcacccacc cctcctgtct ctgtgcaggt gctatgctcg
     4141 ccttcaccct cgtgctgtca actgccgcaa gaagaagtgt ggtcacacca acaacctgcg
     4201 tcccaagaag aaggtcaaat aaggtggttc tttccttgaa gggcagcctc ctgcccaggc
     4261 cccgtggccc tggagcctca ataaagtgtc cctttcattg actggagcag caattggtgt
     4321 cctcatggct gatctgtcca gggaggtggc tgaagagtgg gcatctccct tagggactct
     4381 actcagcact ccattctgtg ccacctgtgg ggtcttctgt cctagattct gtcacatcgg
     4441 cattggtccc tgccctatgc ccctgactct ggatttgtca tctgtaaaac tggagtaaaa
     4501 acctcagtcg tgtaattggt gggactgagg atcagttttg tcattgctgg gatcc
//