GenBank-Updates@genbank.bio.net (05/10/91)
LOCUS MUSSURF38 7107 bp ds-DNA ROD 10-MAY-1991 DEFINITION Mouse surfeit locus surfeit 3 gene, exon 8, and surfeit 1 and 2 genes, complete cds. ACCESSION M14689 M14690 M14691 KEYWORDS B1 repetitive sequence; B2 repetitive sequence; surfeit locus; surfeit protein. SEGMENT 8 of 8 SOURCE Mouse (strain BALB/c) cell line 3T3 DNA, clones IDE and H1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 229 to 7107) AUTHORS Williams,T.J. and Fried,M. TITLE The MES-1 murine enhancer element is closely associated with the heterogeneous 5' ends of two divergent transcription units JOURNAL Mol. Cell. Biol. 6, 4558-4569 (1986) STANDARD full staff_review REFERENCE 2 (bases 1 to 7107) AUTHORS Fried,M. JOURNAL Unpublished (1988) ICRF, P.O.Box 123, London, WC2A 3PX. STANDARD full staff_review REFERENCE 3 (bases 1 to 286) AUTHORS Huxley,C., Williams,T. and Fried,M. TITLE One of the tightly clustered genes of the mouse surfeit locus is a highly expressed member of a multigene family whose other members are predominantly processed pseudogenes JOURNAL Mol. Cell. Biol. 8, 3898-3905 (1988) STANDARD full staff_entry COMMENT [Mol. Cell. Biol. 6, 4558-4569 (1986)] exons and partial introns only. Draft entry and computer readable sequence for [Unpublished (1988) ICRF, P.O.Box 123, London, WC2A 3PX.] kindly provided by M.Fried, 19-SEP-1988. FEATURES Location/Qualifiers prim_transcript <1..160 /note="Surf-3 mRNA and introns" intron <1..13 /note="Surf-3 intron G" CDS join(M21455:25..27,M21456:14..134,M21457:14..163, M21458:14..154,M21459:14..93,M21460:14..144,M14692:14..83, 14..118) /note="surfeit 3 protein" /codon_start=25 CDS join(complement(3155..3229),complement(2853..2904), complement(2497..2630),complement(2333..2415), complement(1608..1796),complement(1052..1124), complement(790..952),complement(426..507), complement(285..354)) /note="surfeit 1 protein" /codon_start=3229 CDS join(3324..3401,3603..3757,4325..4428,5777..5959, 6096..6262,6631..6717) /note="surfeit 2 protein" /codon_start=3324 exon 14..118 /note="surfeit 3 protein, exon 8" prim_transcript complement(237..3233) /note="Surf-1 mRNA and introns" exon complement(285..354) /note="surfeit 1 protein, exon 9" intron complement(356..425) /note="Surf-1, intron H" exon complement(426..507) /note="surfeit 1 protein, exon 8" intron complement(508..798) /note="Surf-1, intron G" exon complement(790..952) /note="surfeit 1 protein, exon 7" intron complement(953..1051) /note="Surf-1, intron F" exon complement(1052..1124) /note="surfeit 1 protein, exon 6" intron complement(1125..1607) /note="Surf-1, intron E" repeat_region 1383..1565 /note="B2 element" exon complement(1608..1796) /note="surfeit 1 protein, exon 5" intron complement(1797..2332) /note="Surf-1, intron D" exon complement(2333..2415) /note="surfeit 1 protein, exon 4" intron complement(2416..2496) /note="Surf-1, intron C" exon complement(2497..2630) /note="surfeit 1 protein, exon 3" intron complement(2630..2852) /note="Surf-1, intron B" exon complement(2853..2904) /note="surfeit 1 protein, exon 2" intron complement(2905..3154) /note="Surf-1, intron A" repeat_region 3005..3051 /note="direct repeat copy AA" repeat_region 3052..3098 /note="direct repeat copy BB" prim_transcript 3128..6909 /note="Surf-2 mRNA and introns" exon complement(3155..3229) /note="surfeit 1 protein, exon 1" repeat_region 3239..3248 /note="direct repeat copy A" repeat_region 3264..3273 /note="direct repeat copy B" exon 3324..3401 /note="surfeit 2 protein, exon 1" intron 3402..3602 /note="Surf-2, intron A" exon 3603..3757 /note="surfeit 2 protein, exon 2" intron 3758..4324 /note="Surf-2, intron B" repeat_region 4049..4193 /note="B1 element" exon 4325..4428 /note="surfeit 2 protein, exon 3" intron 4429..5776 /note="Surf-2, intron C" exon 5777..5959 /note="surfeit 2 protein, exon 4" intron 5960..6095 /note="Surf-2, intron D" exon 6096..6262 /note="surfeit 2 protein, exon 5" intron 6263..6630 /note="Surf-2, intron E" exon 6631..6717 /note="surfeit 2 protein, exon 6" BASE COUNT 1776 a 1716 c 1785 g 1544 t 286 others ORIGIN About 262 bp after segment 7. 1 tctgtcattt cagatccgtc gccactgggg aggcaacgtc ctgggtccta agtctgtggc 61 tcgaattgcc aagctggaaa aagcaaaggc taaagaactc gccactaaat tgggttaaat 121 gtacactaaa ttttctgtac ctaaatataa ttacaaaatt atcttgactg cctttggtta 181 tttgggttgg cgcgagtgtg ccctgtaaaa cggtttcaga ctgagccatg gtttttacat 241 agcaagatct ttattttatg gttgttggga acaggctcct tacttcacat gatgggtgtc 301 cgacgtacaa atttttggaa ccacaaatat gatgtggccg cacacagtcc gtacctgtgg 361 ttgaaagaga ttatagcctg tgcggggtgg ggtgggtgga gggggtagca ggcagcagga 421 ctaaccaggt aaggatgtac tgcatgtgct cattgcgcag agtcactctc gtctgtcctc 481 cgatgggccc gccgggggct gtgctgtctg tggagacagg agtcagctct gtcagacccc 541 tagtgcacag agaagtctgc tggctgtttg gaacnnnnnn nnnnnnnnnn nnnnnnnnnn 601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnccccaa gcaggctgac 781 cacacatacg gaagtctgca tcaatgaaaa tggggtccgc tcctgttatc ttggccatag 841 cttccaggtc tcgataatac cagtgattcc tttctgggct gttctccgga acaaagggct 901 tcctgttttc tgtgagcctc actatgccaa ctaggtctac ttctcccaga acctacaaga 961 gaaggcagga ctgcttttta cgggagtcct gtatggaggc aaggtgactt aactatatac 1021 atttctcggg ctgaagagcc atatgcctta cctggccttt ctgtctggtc tcaggattca 1081 ctttcttcct gggaacaaac cctctattaa ccaggatggt gactctagaa caataataaa 1141 agtcaccctt tgggtgagga agagttccaa tacagatgtc actagtggtt acttcagcat 1201 ttatctactg ggcatataaa tggtttattc tgtggattca gacttcactg ctaagcctta 1261 gtgattcaaa ctggctaaga aaatcgctta atgggctgga gagatggctc agcagttagg 1321 agcactgact gctcttccag agagaggtcc tgagttcaat tcccagcaac cacgtggtgg 1381 ccccacaacc atctgtagtg gaatccgatg ccctcttctt ggtgtgtctg aagacagcta 1441 caaagtgtac tcataaataa aatcttttaa aaaannnnnn nnnnnnnnnn nnnnnnnnnn 1501 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1561 nnnnnnnnnn gactgagcac aatgaggctc agccatgcac tactcacccc aagtcagagc 1621 aatggaaagg agtaactaca tgggccccac tttcagttga ggatagtctg ccagcatctc 1681 gcgcctctcg gacaggatcc accatggtcc gaggcattat gtacaactct ttagagtggt 1741 caaagtggcc cctgaccttc actggcctgt actccaaatt tttcagttcc attgggctat 1801 aaggagaaaa ggaagggctg tgagtaagcc aggcttgtca atcctttaag cagaaaggct 1861 cagtgaggat catcagacag gttatgtcct ccatagcaac ctttgatgcc aaatagtaat 1921 agaagatggt gataaccaga gcccaggttt tcaaggaaag tgtgggtttc tgtatggcta 1981 ctgtatatac attaacaaaa tgtgagtcca aactccactt gttcccaacc ctgaaggaca 2041 aaactttaaa attcaagagg gcaatacaat aaactttaaa gaccaccacc caccagggcc 2101 acaatgaact gtcaccactt tggaggatcc cttgagacct gaaaagggaa atctgacacc 2161 gaaaataaat cttttctatt tatgttctgt gtccaatttc ataaaggcag tcagcaggca 2221 gtttatcggg aaacagggtt ctcgccaaag gacagcaaat tacaccaaac agaagcaagg 2281 gccccactgc cgccctgtaa gtggaatgaa taaagtgggc agccacactc actctgctgg 2341 tagagggatg ggctcagcca tgactcgaga ctctaattct gcaataagtt tcagcttcca 2401 tttccgacgt tggacctata gaaacagcat gagctaagct aaggtatcag tggagtccga 2461 gcctcggagc cacccaaatt cgacccccat cttcacctgc caagtcccca ggccaaaagc 2521 agtagcaggg attaaaagca ggaaccactg gagaaaagaa tcgtcctctg ctttagcggc 2581 ggctgtttca gcagtagaac tgcaacacct gcgtggccta cagaccatcc ctggaggaag 2641 ttttacaaca ctggcatggg gtcagaagcc ctagaacaaa cagaactgga tggccaattc 2701 ctcgagaggc ccgattatac cccagagaat gtatggcaaa ggtggcaatt aacagcacat 2761 gtgcccaacc ctctcccagt acaatacacg cagtaaacca ctgtgggctg aaaagtttgg 2821 tgacacaccg caccaggacg agcccttctc acctgagcgg acagaaaacc caaagacgct 2881 cctcctgaca gcgcagaact gggcctgcgg ggaccagaca ggccgagggg ctaaatccag 2941 ggaaccccac gcctgccaag tgcaagcacc tgacacggtc caacccagcc ccacacaact 3001 tcgagctaag tcttcttggg cagggtcctc cctggcaccg ctccacccag ggctaagttc 3061 tcttgggcag ggtcctcctc ggccccactc cactccgggc ccggccctcc agatccctct 3121 tcttgagcag gatcctctct gaacccccgc ttacccgtcc cgcgtaggcc cattgcgacc 3181 accgcgtcat ccgtcgcggc agaccagcca aagccatcac agcagccatc tttgagcact 3241 tccgggaccg agaaatctgc ttccttccgg gacggcgctc tgtctcacgt ggtggctgcg 3301 gcgtcagatg ggcatctccg atcatggacg aaccaccctc tgatgtgctt gcattcttgc 3361 gccagcaccc cagcttgaga ctactgccca acacccgcaa ggtcggaaga gaaccgtgac 3421 cggaagagta gggccaagaa ggaacgggat ctgggaggag cggaacctcg gaggaccgac 3481 agtggccaga aaggaggggg acccgggagt attgtgccta gaaagcgtgg gacctgggag 3541 gacagtgcag ggaaggggcg gagctggcaa gaaccacctt ctgaccagat gctcatccgc 3601 aggttcgctg ctccctaact ggccatgagc tgccctgccg tctgcccgag ctccaggaat 3661 acacccgcgg caagaagtac caacggctgt caagttcctt ctctaacttc gattacgcag 3721 ctttcgagcc acacattgtg cccagcacaa agaatcggta cgtagtttgg ccggccagcg 3781 cctaggagca gagcctgtgc cccctgttct gagcttgtag gagcctccca ctgtgacact 3841 tatcaggaca gtgcttgtgg cccttgcatg tagttccctg aagccaggct gggcagtgcc 3901 agtttcactg tttggggatc ctgtctatgg ttgtgtggtt gagtatatcc tcatatagca 3961 ctccctgagc tgtctgaccg ccaagcatgc agtgtgaaac tgcaatttct tgggtaacaa 4021 tctgggtcct tgatcccaga tactgagtgg aattgtttgt aagcctgtgt agctgggttg 4081 gcctggaact cactatgtag gccagactga ccttaaagcc tgcctctgac tctgcccctc 4141 tgcctctgcc acctgagtgc tggaattaaa gtccaccatg cctttttcaa aatccagaca 4201 tgagactaaa tgcagagcat cccagcagat ggatcatttg tttgcagcat gcagccatgg 4261 tggagcccac aagactgggc ccttggtgtg catgcccctc catacccacc tgtttgcctt 4321 ttaggcacca actgttctgc aaactcaccc tgaggcacat caataagtcc ccagaacacg 4381 tgctgaggca cacccagggc cggaggtatc agagagcact tcatcaatgt aagtcacccc 4441 ccagaagctc aggcctttgg atgcatgctc cctgccccca aaaagccaga aaggtgcctt 4501 ggccatcact gatcctttat tgatgcactt gcctgatctt gcacagtgag aagcaaagcc 4561 agaagtaggt gggaaggagt gcggtggagg tctgatattc catagcaggg acaatcctag 4621 ggtagctccc aggctgaggc tgcagtattc tagttatagg acaggtccat agtgcatctg 4681 atactgcagg agagccttga agagctgttc tcaggagcag gattggtgca aacagtcctg 4741 caggctccag agggtgaggg gtcttgcagc taaggggttg gtggcttgag caggaaaggg 4801 gacttgcaga cagcctccag tattgttggg tatagccagg ggtgggaagg catcccaagt 4861 cattgagagt agagactcct ggggttaggg acagagagag gatggggcca gcttgagcat 4921 gagggaggtg aaaacctccc atccatgcta ccacaggaag atgagggtaa gaaagggact 4981 gggactaggt aacttggatg gaggaatgaa gcagggagaa tggagggttg gggttcgcag 5041 aggctgggga tgagtgggag gcttcaggga ctgcatcgcc tgtcctgccc taacaggcag 5101 acttcagtca cagggtggga aggtaaggaa tctgtcagtt gtctgctggt gcagagttgg 5161 cacctgggct gcactgaccg gtggcaagct aaggtttggc tgttgtacct ttgttagctt 5221 tgtcatctgc tacattttca cagggagggc tgtacaggaa tccccatgat tataaagatg 5281 tatcaaagag ctgagacaga gccaaaacac agtgaggccg gttagatggc aggctggtta 5341 ccgtagctaa ttggcctgct gagagcactg ggaggagacc acagtggtcc tagaaataga 5401 tgtcaaagat gaagcaggga agactggaga tagcagagca gttggttcta tcagatgccc 5461 agtcccaatt agactgttac tcatgagcaa tgagctggta tccctcacct atggagatgg 5521 tgctggccca agaagttcag actcctcagt acctgaggac agcatgtgac cctagtaagt 5581 gccttccctg aagcctcatt cccgcgccaa gttgaagtga agccaggcag tttcccctga 5641 ggtccaaccc tttggctgcc cacccctatc tggtaggata aaggaacaga accatggcac 5701 aatacaacac aatacaaccc atagcttagc tgtgcaccaa gagggtacca gtgtaatgag 5761 cttgctagtc tcctagatga agagtgtcag aaacaaggtg tggaatatgt ccctgcctgc 5821 cttctacaca agaggaagaa gagagaggac cagacgaaca gtgatgaact cccaggccag 5881 agaacaggtt tctgggagcc agcttccagt gacgaggaag acgccttgag tgacgacagc 5941 atgacagacc tatacccacg taagcagacc cagtccaatc cctgcccctg tctcctgcta 6001 ggtaccttgt gatcctgctg tgagccactt ttcctaagaa agggagggat gcttttacct 6061 gtgacacagc tcagccaagt gtttgttcct tccagctgag ctgttcacaa agagagaact 6121 aggcaagcct aagaacgatg acactcctga agactttctg acagaccaac aggatgagaa 6181 gccggagcat tcagaagaga agagctttag agagagggaa gaggccagag tgggccacaa 6241 gaggggtcgt aaactgagga aggtgagtgg tgggctgcag agtgctgagg gtagggggct 6301 actcagcaaa ttctgcaagt gcacttgtcg ggtagcagcc taaaatatgg aagccaaggc 6361 cactttagaa ggtctgcagg agtaagtcct gactgctcag gaacttcact gctgggtact 6421 ccctctctat gggacaggac tgagtattct gtcagagagg agcaaggtag acaaagtctc 6481 tagtcacctg tgttgccctc tcccaggccc cacactgacc cagcaataga agtgttgaga 6541 aggcagactt aaggctcttc cggcttagac acacagtgcc tttccctgtg aacctgcaga 6601 ggccctggtc agcattttct gctttcacag aagcagctca cctccttgac caagaagttc 6661 aagagctatc atcacaagcc caagaacttc agttccttta agcagctggg gagatgaagc 6721 ctgcaaaaaa agcctttgga tgtccttgga aatggccact gatatcctga cattggctgt 6781 tgtccagagt gggcctggag gttctgttca gaagatggca gcgtcgggga cttgcatggc 6841 tttctctgca gtgttgagtg actcttgctg tagctcactg tagcatcccg aagcctagcc 6901 aggctcccag ctctggctgc tcaccatata ggaccacact atataccaca aatgccatta 6961 tttattttga tgtttccaaa gatcaaaaca ttttcaaaca caactaagat ataaaataca 7021 gcataaaaat gaggtttata cctattccca cataaagcta aagcatcttc agaaaacttt 7081 tgccaaacca atatgtgtat cattgtt //