GenBank-Updates@genbank.bio.net (05/10/91)
LOCUS MUSSURF38 7107 bp ds-DNA ROD 10-MAY-1991
DEFINITION Mouse surfeit locus surfeit 3 gene, exon 8, and surfeit 1 and 2
genes, complete cds.
ACCESSION M14689 M14690 M14691
KEYWORDS B1 repetitive sequence; B2 repetitive sequence; surfeit locus;
surfeit protein.
SEGMENT 8 of 8
SOURCE Mouse (strain BALB/c) cell line 3T3 DNA, clones IDE and H1.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 229 to 7107)
AUTHORS Williams,T.J. and Fried,M.
TITLE The MES-1 murine enhancer element is closely associated with the
heterogeneous 5' ends of two divergent transcription units
JOURNAL Mol. Cell. Biol. 6, 4558-4569 (1986)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 7107)
AUTHORS Fried,M.
JOURNAL Unpublished (1988) ICRF, P.O.Box 123, London, WC2A 3PX.
STANDARD full staff_review
REFERENCE 3 (bases 1 to 286)
AUTHORS Huxley,C., Williams,T. and Fried,M.
TITLE One of the tightly clustered genes of the mouse surfeit locus is a
highly expressed member of a multigene family whose other members
are predominantly processed pseudogenes
JOURNAL Mol. Cell. Biol. 8, 3898-3905 (1988)
STANDARD full staff_entry
COMMENT
[Mol. Cell. Biol. 6, 4558-4569 (1986)] exons and partial introns
only.
Draft entry and computer readable sequence for [Unpublished (1988)
ICRF, P.O.Box 123, London, WC2A 3PX.] kindly provided
by M.Fried, 19-SEP-1988.
FEATURES Location/Qualifiers
prim_transcript <1..160
/note="Surf-3 mRNA and introns"
intron <1..13
/note="Surf-3 intron G"
CDS join(M21455:25..27,M21456:14..134,M21457:14..163,
M21458:14..154,M21459:14..93,M21460:14..144,M14692:14..83,
14..118)
/note="surfeit 3 protein"
/codon_start=25
CDS join(complement(3155..3229),complement(2853..2904),
complement(2497..2630),complement(2333..2415),
complement(1608..1796),complement(1052..1124),
complement(790..952),complement(426..507),
complement(285..354))
/note="surfeit 1 protein"
/codon_start=3229
CDS join(3324..3401,3603..3757,4325..4428,5777..5959,
6096..6262,6631..6717)
/note="surfeit 2 protein"
/codon_start=3324
exon 14..118
/note="surfeit 3 protein, exon 8"
prim_transcript complement(237..3233)
/note="Surf-1 mRNA and introns"
exon complement(285..354)
/note="surfeit 1 protein, exon 9"
intron complement(356..425)
/note="Surf-1, intron H"
exon complement(426..507)
/note="surfeit 1 protein, exon 8"
intron complement(508..798)
/note="Surf-1, intron G"
exon complement(790..952)
/note="surfeit 1 protein, exon 7"
intron complement(953..1051)
/note="Surf-1, intron F"
exon complement(1052..1124)
/note="surfeit 1 protein, exon 6"
intron complement(1125..1607)
/note="Surf-1, intron E"
repeat_region 1383..1565
/note="B2 element"
exon complement(1608..1796)
/note="surfeit 1 protein, exon 5"
intron complement(1797..2332)
/note="Surf-1, intron D"
exon complement(2333..2415)
/note="surfeit 1 protein, exon 4"
intron complement(2416..2496)
/note="Surf-1, intron C"
exon complement(2497..2630)
/note="surfeit 1 protein, exon 3"
intron complement(2630..2852)
/note="Surf-1, intron B"
exon complement(2853..2904)
/note="surfeit 1 protein, exon 2"
intron complement(2905..3154)
/note="Surf-1, intron A"
repeat_region 3005..3051
/note="direct repeat copy AA"
repeat_region 3052..3098
/note="direct repeat copy BB"
prim_transcript 3128..6909
/note="Surf-2 mRNA and introns"
exon complement(3155..3229)
/note="surfeit 1 protein, exon 1"
repeat_region 3239..3248
/note="direct repeat copy A"
repeat_region 3264..3273
/note="direct repeat copy B"
exon 3324..3401
/note="surfeit 2 protein, exon 1"
intron 3402..3602
/note="Surf-2, intron A"
exon 3603..3757
/note="surfeit 2 protein, exon 2"
intron 3758..4324
/note="Surf-2, intron B"
repeat_region 4049..4193
/note="B1 element"
exon 4325..4428
/note="surfeit 2 protein, exon 3"
intron 4429..5776
/note="Surf-2, intron C"
exon 5777..5959
/note="surfeit 2 protein, exon 4"
intron 5960..6095
/note="Surf-2, intron D"
exon 6096..6262
/note="surfeit 2 protein, exon 5"
intron 6263..6630
/note="Surf-2, intron E"
exon 6631..6717
/note="surfeit 2 protein, exon 6"
BASE COUNT 1776 a 1716 c 1785 g 1544 t 286 others
ORIGIN About 262 bp after segment 7.
1 tctgtcattt cagatccgtc gccactgggg aggcaacgtc ctgggtccta agtctgtggc
61 tcgaattgcc aagctggaaa aagcaaaggc taaagaactc gccactaaat tgggttaaat
121 gtacactaaa ttttctgtac ctaaatataa ttacaaaatt atcttgactg cctttggtta
181 tttgggttgg cgcgagtgtg ccctgtaaaa cggtttcaga ctgagccatg gtttttacat
241 agcaagatct ttattttatg gttgttggga acaggctcct tacttcacat gatgggtgtc
301 cgacgtacaa atttttggaa ccacaaatat gatgtggccg cacacagtcc gtacctgtgg
361 ttgaaagaga ttatagcctg tgcggggtgg ggtgggtgga gggggtagca ggcagcagga
421 ctaaccaggt aaggatgtac tgcatgtgct cattgcgcag agtcactctc gtctgtcctc
481 cgatgggccc gccgggggct gtgctgtctg tggagacagg agtcagctct gtcagacccc
541 tagtgcacag agaagtctgc tggctgtttg gaacnnnnnn nnnnnnnnnn nnnnnnnnnn
601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnccccaa gcaggctgac
781 cacacatacg gaagtctgca tcaatgaaaa tggggtccgc tcctgttatc ttggccatag
841 cttccaggtc tcgataatac cagtgattcc tttctgggct gttctccgga acaaagggct
901 tcctgttttc tgtgagcctc actatgccaa ctaggtctac ttctcccaga acctacaaga
961 gaaggcagga ctgcttttta cgggagtcct gtatggaggc aaggtgactt aactatatac
1021 atttctcggg ctgaagagcc atatgcctta cctggccttt ctgtctggtc tcaggattca
1081 ctttcttcct gggaacaaac cctctattaa ccaggatggt gactctagaa caataataaa
1141 agtcaccctt tgggtgagga agagttccaa tacagatgtc actagtggtt acttcagcat
1201 ttatctactg ggcatataaa tggtttattc tgtggattca gacttcactg ctaagcctta
1261 gtgattcaaa ctggctaaga aaatcgctta atgggctgga gagatggctc agcagttagg
1321 agcactgact gctcttccag agagaggtcc tgagttcaat tcccagcaac cacgtggtgg
1381 ccccacaacc atctgtagtg gaatccgatg ccctcttctt ggtgtgtctg aagacagcta
1441 caaagtgtac tcataaataa aatcttttaa aaaannnnnn nnnnnnnnnn nnnnnnnnnn
1501 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1561 nnnnnnnnnn gactgagcac aatgaggctc agccatgcac tactcacccc aagtcagagc
1621 aatggaaagg agtaactaca tgggccccac tttcagttga ggatagtctg ccagcatctc
1681 gcgcctctcg gacaggatcc accatggtcc gaggcattat gtacaactct ttagagtggt
1741 caaagtggcc cctgaccttc actggcctgt actccaaatt tttcagttcc attgggctat
1801 aaggagaaaa ggaagggctg tgagtaagcc aggcttgtca atcctttaag cagaaaggct
1861 cagtgaggat catcagacag gttatgtcct ccatagcaac ctttgatgcc aaatagtaat
1921 agaagatggt gataaccaga gcccaggttt tcaaggaaag tgtgggtttc tgtatggcta
1981 ctgtatatac attaacaaaa tgtgagtcca aactccactt gttcccaacc ctgaaggaca
2041 aaactttaaa attcaagagg gcaatacaat aaactttaaa gaccaccacc caccagggcc
2101 acaatgaact gtcaccactt tggaggatcc cttgagacct gaaaagggaa atctgacacc
2161 gaaaataaat cttttctatt tatgttctgt gtccaatttc ataaaggcag tcagcaggca
2221 gtttatcggg aaacagggtt ctcgccaaag gacagcaaat tacaccaaac agaagcaagg
2281 gccccactgc cgccctgtaa gtggaatgaa taaagtgggc agccacactc actctgctgg
2341 tagagggatg ggctcagcca tgactcgaga ctctaattct gcaataagtt tcagcttcca
2401 tttccgacgt tggacctata gaaacagcat gagctaagct aaggtatcag tggagtccga
2461 gcctcggagc cacccaaatt cgacccccat cttcacctgc caagtcccca ggccaaaagc
2521 agtagcaggg attaaaagca ggaaccactg gagaaaagaa tcgtcctctg ctttagcggc
2581 ggctgtttca gcagtagaac tgcaacacct gcgtggccta cagaccatcc ctggaggaag
2641 ttttacaaca ctggcatggg gtcagaagcc ctagaacaaa cagaactgga tggccaattc
2701 ctcgagaggc ccgattatac cccagagaat gtatggcaaa ggtggcaatt aacagcacat
2761 gtgcccaacc ctctcccagt acaatacacg cagtaaacca ctgtgggctg aaaagtttgg
2821 tgacacaccg caccaggacg agcccttctc acctgagcgg acagaaaacc caaagacgct
2881 cctcctgaca gcgcagaact gggcctgcgg ggaccagaca ggccgagggg ctaaatccag
2941 ggaaccccac gcctgccaag tgcaagcacc tgacacggtc caacccagcc ccacacaact
3001 tcgagctaag tcttcttggg cagggtcctc cctggcaccg ctccacccag ggctaagttc
3061 tcttgggcag ggtcctcctc ggccccactc cactccgggc ccggccctcc agatccctct
3121 tcttgagcag gatcctctct gaacccccgc ttacccgtcc cgcgtaggcc cattgcgacc
3181 accgcgtcat ccgtcgcggc agaccagcca aagccatcac agcagccatc tttgagcact
3241 tccgggaccg agaaatctgc ttccttccgg gacggcgctc tgtctcacgt ggtggctgcg
3301 gcgtcagatg ggcatctccg atcatggacg aaccaccctc tgatgtgctt gcattcttgc
3361 gccagcaccc cagcttgaga ctactgccca acacccgcaa ggtcggaaga gaaccgtgac
3421 cggaagagta gggccaagaa ggaacgggat ctgggaggag cggaacctcg gaggaccgac
3481 agtggccaga aaggaggggg acccgggagt attgtgccta gaaagcgtgg gacctgggag
3541 gacagtgcag ggaaggggcg gagctggcaa gaaccacctt ctgaccagat gctcatccgc
3601 aggttcgctg ctccctaact ggccatgagc tgccctgccg tctgcccgag ctccaggaat
3661 acacccgcgg caagaagtac caacggctgt caagttcctt ctctaacttc gattacgcag
3721 ctttcgagcc acacattgtg cccagcacaa agaatcggta cgtagtttgg ccggccagcg
3781 cctaggagca gagcctgtgc cccctgttct gagcttgtag gagcctccca ctgtgacact
3841 tatcaggaca gtgcttgtgg cccttgcatg tagttccctg aagccaggct gggcagtgcc
3901 agtttcactg tttggggatc ctgtctatgg ttgtgtggtt gagtatatcc tcatatagca
3961 ctccctgagc tgtctgaccg ccaagcatgc agtgtgaaac tgcaatttct tgggtaacaa
4021 tctgggtcct tgatcccaga tactgagtgg aattgtttgt aagcctgtgt agctgggttg
4081 gcctggaact cactatgtag gccagactga ccttaaagcc tgcctctgac tctgcccctc
4141 tgcctctgcc acctgagtgc tggaattaaa gtccaccatg cctttttcaa aatccagaca
4201 tgagactaaa tgcagagcat cccagcagat ggatcatttg tttgcagcat gcagccatgg
4261 tggagcccac aagactgggc ccttggtgtg catgcccctc catacccacc tgtttgcctt
4321 ttaggcacca actgttctgc aaactcaccc tgaggcacat caataagtcc ccagaacacg
4381 tgctgaggca cacccagggc cggaggtatc agagagcact tcatcaatgt aagtcacccc
4441 ccagaagctc aggcctttgg atgcatgctc cctgccccca aaaagccaga aaggtgcctt
4501 ggccatcact gatcctttat tgatgcactt gcctgatctt gcacagtgag aagcaaagcc
4561 agaagtaggt gggaaggagt gcggtggagg tctgatattc catagcaggg acaatcctag
4621 ggtagctccc aggctgaggc tgcagtattc tagttatagg acaggtccat agtgcatctg
4681 atactgcagg agagccttga agagctgttc tcaggagcag gattggtgca aacagtcctg
4741 caggctccag agggtgaggg gtcttgcagc taaggggttg gtggcttgag caggaaaggg
4801 gacttgcaga cagcctccag tattgttggg tatagccagg ggtgggaagg catcccaagt
4861 cattgagagt agagactcct ggggttaggg acagagagag gatggggcca gcttgagcat
4921 gagggaggtg aaaacctccc atccatgcta ccacaggaag atgagggtaa gaaagggact
4981 gggactaggt aacttggatg gaggaatgaa gcagggagaa tggagggttg gggttcgcag
5041 aggctgggga tgagtgggag gcttcaggga ctgcatcgcc tgtcctgccc taacaggcag
5101 acttcagtca cagggtggga aggtaaggaa tctgtcagtt gtctgctggt gcagagttgg
5161 cacctgggct gcactgaccg gtggcaagct aaggtttggc tgttgtacct ttgttagctt
5221 tgtcatctgc tacattttca cagggagggc tgtacaggaa tccccatgat tataaagatg
5281 tatcaaagag ctgagacaga gccaaaacac agtgaggccg gttagatggc aggctggtta
5341 ccgtagctaa ttggcctgct gagagcactg ggaggagacc acagtggtcc tagaaataga
5401 tgtcaaagat gaagcaggga agactggaga tagcagagca gttggttcta tcagatgccc
5461 agtcccaatt agactgttac tcatgagcaa tgagctggta tccctcacct atggagatgg
5521 tgctggccca agaagttcag actcctcagt acctgaggac agcatgtgac cctagtaagt
5581 gccttccctg aagcctcatt cccgcgccaa gttgaagtga agccaggcag tttcccctga
5641 ggtccaaccc tttggctgcc cacccctatc tggtaggata aaggaacaga accatggcac
5701 aatacaacac aatacaaccc atagcttagc tgtgcaccaa gagggtacca gtgtaatgag
5761 cttgctagtc tcctagatga agagtgtcag aaacaaggtg tggaatatgt ccctgcctgc
5821 cttctacaca agaggaagaa gagagaggac cagacgaaca gtgatgaact cccaggccag
5881 agaacaggtt tctgggagcc agcttccagt gacgaggaag acgccttgag tgacgacagc
5941 atgacagacc tatacccacg taagcagacc cagtccaatc cctgcccctg tctcctgcta
6001 ggtaccttgt gatcctgctg tgagccactt ttcctaagaa agggagggat gcttttacct
6061 gtgacacagc tcagccaagt gtttgttcct tccagctgag ctgttcacaa agagagaact
6121 aggcaagcct aagaacgatg acactcctga agactttctg acagaccaac aggatgagaa
6181 gccggagcat tcagaagaga agagctttag agagagggaa gaggccagag tgggccacaa
6241 gaggggtcgt aaactgagga aggtgagtgg tgggctgcag agtgctgagg gtagggggct
6301 actcagcaaa ttctgcaagt gcacttgtcg ggtagcagcc taaaatatgg aagccaaggc
6361 cactttagaa ggtctgcagg agtaagtcct gactgctcag gaacttcact gctgggtact
6421 ccctctctat gggacaggac tgagtattct gtcagagagg agcaaggtag acaaagtctc
6481 tagtcacctg tgttgccctc tcccaggccc cacactgacc cagcaataga agtgttgaga
6541 aggcagactt aaggctcttc cggcttagac acacagtgcc tttccctgtg aacctgcaga
6601 ggccctggtc agcattttct gctttcacag aagcagctca cctccttgac caagaagttc
6661 aagagctatc atcacaagcc caagaacttc agttccttta agcagctggg gagatgaagc
6721 ctgcaaaaaa agcctttgga tgtccttgga aatggccact gatatcctga cattggctgt
6781 tgtccagagt gggcctggag gttctgttca gaagatggca gcgtcgggga cttgcatggc
6841 tttctctgca gtgttgagtg actcttgctg tagctcactg tagcatcccg aagcctagcc
6901 aggctcccag ctctggctgc tcaccatata ggaccacact atataccaca aatgccatta
6961 tttattttga tgtttccaaa gatcaaaaca ttttcaaaca caactaagat ataaaataca
7021 gcataaaaat gaggtttata cctattccca cataaagcta aagcatcttc agaaaacttt
7081 tgccaaacca atatgtgtat cattgtt
//