GenBank-Updates@genbank.bio.net (05/22/91)
LOCUS HUMUBA52G 4555 bp ds-DNA PRI 22-MAY-1991
DEFINITION Human UbA52 gene coding for ubiquitin-52 amino acid fusion protein
ACCESSION X56997
KEYWORDS UbA52 gene; ubiquitin-fusion protein.
SOURCE Homo sapiens DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 4555)
AUTHORS Baker,R.T.
JOURNAL Unpublished (1990)
STANDARD full automatic
REFERENCE 2 (bases 1 to 4555)
AUTHORS Baker,R.T. and Board,P.G.
TITLE The human ubiquitin-52 amino acid fusion protein gene shares
several structural features with mammalian ribosomal protein genes
JOURNAL Nucleic Acids Res. 19, 1035-1040 (1991)
STANDARD full automatic
COMMENT See <X56997-9>, <Y00361> for related sequences.
From EMBL 26 entry HSUBA52G; dated 26-MAR-1991.
FEATURES Location/Qualifiers
mRNA join(934..962,2363..2473,2733..2819,3942..4044,4129..4312)
/note="in adrenal gland cDNA clone"
/gene="UbA52"
mRNA join(934..962,2363..2473,2733..2819,3942..4044,4129..4306)
/note="in placental gland cDNA clone"
/gene="UbA52"
variation replace(4226..4226,4226..4226)
/note="in adrenal/placental cDNAs"
CDS join(2371..2473,2733..2819,3942..4044,4129..4222)
/product="ubiquitin-52 amino acid fusion protein"
/gene="UbA52"
/codon_start=2371
repeat_unit 114..406
/rpt_family="Alu"
repeat_unit 743..758
/rpt_type=direct
misc_binding 747..752
/bound_moiety="Sp1"
repeat_unit 827..843
/rpt_type=direct
misc_binding 832..837
/bound_moiety="Sp1"
repeat_unit 932..944
/rpt_type=inverted
/note="pyrimidine tract"
exon 934..962
/number=1
exon 934..962
exon 934..962
prim_transcript 934..4312
/gene="UbA52"
intron 963..2362
/number=1
misc_binding 1037..1042
/bound_moiety="Sp1"
misc_binding 1208..1213
/bound_moiety="Sp1"
repeat_unit complement(1415..1703)
/rpt_family="Alu"
exon 2363..2473
/number=2
exon 2363..2473
exon 2363..2473
intron 2474..2732
/number=2
exon 2733..2819
/number=3
exon 2733..2819
exon 2733..2819
exon 2733..2819
intron 2820..3941
/number=3
repeat_unit 2939..3296
/rpt_family="Alu"
repeat_unit complement(3373..3623)
/rpt_family="Alu"
exon 3942..4044
/number=4
exon 3942..4044
exon 3942..4044
exon 3942..4044
intron 4045..4128
/number=4
exon 4129..4312
/number=5
exon 4129..4312
polyA_signal 4280..4285
BASE COUNT 970 a 1178 c 1341 g 1066 t
ORIGIN
1 ggatccgcac atctcggcct cccaaagtgc aggcgtgagt caccggaccc aggtcccgcc
61 ctggcacttt ttaaccaccc acaaatctgg atcctacact gaaaagagac actgcagtgg
121 ctcacgtctg taatcccagc actttgggag gccaaggcgg gcggatcacc tgaggtcgcg
181 agtttgagac cagcctgacc aacatggaga aaccccgtct ctactaaaaa tacaaaagtg
241 gccaggcatg gtgtcgcaca cttgtaatcg cagctactcg ggaggctgag gcaggagaat
301 tgcttgaacc caggaggcgg aggttgcggt gagccgagat cgcgccattg cactacagcc
361 tgggcaacga gagcgaaact ccgtctcaaa aaaaaaaaaa aaaaaatcct gagtcccgct
421 tgacaccttt tgtcaggcac caccaccttt ctgggcgaat gcggtagtac cgtctgctct
481 ccctgctgct gtcctgaaat ccattcaggc acagcggccg agagctttat aataaccgat
541 tccaggtgtt aggtgctttc ccagccccga ctcctgcgtc ctggacccgc agtcctctgc
601 ttaatacctt tgctttatta gaaaacattc tcctctactc cgttcagcta ttcgctgagg
661 gcccgccaac cgccagcggt tgtcaatggc ctagaggcag cggacgcaaa cacggggaga
721 ggtgcaatcg tctcaagtga ctcggcgggc ggggcccaca accggaagcg ggtgggcgac
781 cttcacccac gtgcgctgcg gcttcgttcg ccagcatcca agatggcggc agggcggggc
841 ccaaggcgcg gcgcgaattg tgacgcaggc gtccggcgtg ctccgtcgca agcgctttcg
901 gcggcgatta ggtggtttcc ggttccgcta tcttcttttt cttcagcgag gcggccgagc
961 tggttggtgg cggcggtcgt gcgggttcgc gccgggccga gagcgggttg ggggctgcgg
1021 gaggctgcag gggcctgggc ggcagaagag gcggccctga gctggctcat gcgggccagt
1081 ctcggcaggg tggctgggca gggctcgcga ggccacggct cggagcccag accggggccc
1141 aggagcgaac gccgttttgg agaggagcct gcctgctctg cctgccagcg tgaccccacg
1201 aggcctcggg cgggaagagg tcctcggggc agatccgagt taatgagaga ggggtattga
1261 gcgtgtagcg ttaactctgc cagtcactgc gtcagtcgct ttggaaatac taaatttctc
1321 gagctgagtc ttcatacctg gctcctaatc tacgtctgta aggaggagct ggtggtagtg
1381 tctgcttttt agacttttct ttagactatt tgtatttttt tcagatggag tcttgctctg
1441 tcgcctaagc tggagttcag tggtgcggtc tcggctcact gcaatctcca cctcccgggc
1501 tcgagcgatt cttctgcctg agcctcccga gtagctggga ttataggcgc ctggcaccac
1561 gcccagttga tttttgtagt tttagtagag acggagtttc accatgttag ccaggctcat
1621 cttgaactct tgacctcaaa tgatccgtct gcctcggcct tccaaagtgc tgggattaca
1681 ggcatgagcc cctgcgcccg gtcgattctt tgtcttttta agtcaacttt tatatgtgaa
1741 caatgcttgg caggtggttg gtagatacta agtgatgttc gtggtttggg gtcaaggcaa
1801 gaagtggggt ctggagagtt ttggtgtaat tgagaaggaa gctaagagtg ttgggtgctc
1861 cagcttggag ttagagagga gagaggctgc cacaggaaga catgtgtgtt gtaggggatg
1921 gcttcccatc caggctggca gcaggagcag cctgtgcaga tcaggacctt gctccctgga
1981 agagggtgga ccgccttcag ggaagatgga tctagcaaga tgatgccaaa gggtacttat
2041 tccatcagga gatactgacg agtccttccg ccgctaaacc taaggtgaat aaccacagtc
2101 tgtgttcctg aagagcaccc gtgcggtcag gagggtggag gacatgtgat cttagttcca
2161 ggacatgttt agactacagg ccagggtgtg tgagaagcct agcagggcca ggcttggagg
2221 agtgaaagga agacaggtac tggggcagga ccagttggac ttggtgcagg caaagggata
2281 gcaactgtgg tgtaggcacc tgagcttgtg ctactcaggc atgcattgct caccagtcta
2341 tcctgccgcc catcctcctc agacgcaaac atgcagatct ttgtgaagac cctcactggc
2401 aaaaccatca cccttgaggt cgagcccagt gacaccattg agaatgtcaa agccaaaatt
2461 caagacaagg agggtgagta gggctgggtg tgggggctct ggctgtgaac tgggagtccc
2521 tctctcgccc aggggagtct cagtcctgtg tgggttgtgc tgactttaga tctgttttgc
2581 ccttgcttct ccatgtgatc tgaagaacgt ttgttatctt ctacctcagt tggccttttg
2641 agaaactggg ggtagtgctg gagctcccct gcagaggaca ctgccagtaa tatggtccgc
2701 agagcctcta actgagcctc cctccccctc aggtatccca cctgaccagc agcgtctgat
2761 atttgccggc aaacagctgg aggatggccg cactctctca gactacaaca tccagaaagg
2821 taccggggtt ggggttgctg ggcagggacc caagatcccc aggtcctagg aaaggagcat
2881 tgatggcctc aggggttggg gagcagttca aatgacttgt gttttgttta aataatggga
2941 ctgggcacag tggctcatgc ctgtaatccc ggcactttgg gaggcttagg cgggtggatc
3001 acctgaggtc aggagttcaa gaccagcctg gacaacgtgg tgaaatcccg tttctattaa
3061 aaatacaaaa atcagctggg tgcagtggct caggcctgta atcccagcac ttcgggaggc
3121 tgaggcgggc agatcacaag gtcaagagat tgagatcatc atgaccaaca tggtgaaatc
3181 ccatctctac taaaaataca aaaattagct aggcatggtg gtgcgtgcct gtagtcccag
3241 ctactcagga ggctgaggaa ggagaattgc ttgaactcgg gagacaaaaa aaaaaagtca
3301 taatgtgaat ttttttatca ctgcaataag gaaattagtg tcacttgtgg gagcgacaag
3361 aattcagtgt cctttttttg tgagacagag tcttactctg tcacccaggc tggagtgcag
3421 tgacgcgatc tcactgtgac ctccgtctcc cgggttcaag cgattcccct gcctcagcct
3481 cccgagtagc tgggattaca ggcacccgcc accacgccca gctaattttt tttgtatttt
3541 tagtagagac agggtttcac tacgttggcc aggctggtct cttaaagtgc taggattaca
3601 ggcgtgagcc atggtgcccc gcctagactt cagtgtctga ccttgcctga accacttaga
3661 ggtcggcttc catgttagaa acccagatgg atgcctcagt tggcatgtgt cagtctcaga
3721 ctccccccag ggctcgtggt cagtgctgag atggagattt cctggggcag gctggctggg
3781 acagtgtatc atccacacgt agaacgacgg cgggggatcc cgacttggtg tccccatcac
3841 acttgagaaa gcagcagact ataggccctg gagggtcctg cccctgtgac tgaggagcca
3901 gggctgggct cagtcgccgt ccttctggct gtctcctgca gagtccaccc tgcacctggt
3961 gttgcgcctg cgaggtggca ttattgagcc ttctctccgc cagcttgccc agaaatacaa
4021 ctgcgacaag atgatctgcc gcaagtatgt gtgctccgat gcttgggggg ctgtgggggc
4081 tgccggagtc ggggtatgcc ctcacccacc cctcctgtct ctgtgcaggt gctatgctcg
4141 ccttcaccct cgtgctgtca actgccgcaa gaagaagtgt ggtcacacca acaacctgcg
4201 tcccaagaag aaggtcaaat aaggtggttc tttccttgaa gggcagcctc ctgcccaggc
4261 cccgtggccc tggagcctca ataaagtgtc cctttcattg actggagcag caattggtgt
4321 cctcatggct gatctgtcca gggaggtggc tgaagagtgg gcatctccct tagggactct
4381 actcagcact ccattctgtg ccacctgtgg ggtcttctgt cctagattct gtcacatcgg
4441 cattggtccc tgccctatgc ccctgactct ggatttgtca tctgtaaaac tggagtaaaa
4501 acctcagtcg tgtaattggt gggactgagg atcagttttg tcattgctgg gatcc
//