[bionet.molbio.genbank.updates] Mouse surfeit locus surfeit 3 gene, exon 8, and surfeit 1 and 2

GenBank-Updates@genbank.bio.net (05/10/91)

LOCUS       MUSSURF38    7107 bp ds-DNA             ROD       10-MAY-1991
DEFINITION  Mouse surfeit locus surfeit 3 gene, exon 8, and surfeit 1 and 2
            genes, complete cds.
ACCESSION   M14689 M14690 M14691
KEYWORDS    B1 repetitive sequence; B2 repetitive sequence; surfeit locus;
            surfeit protein.
SEGMENT     8 of 8
SOURCE      Mouse (strain BALB/c) cell line 3T3 DNA, clones IDE and H1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 229 to 7107)
  AUTHORS   Williams,T.J. and Fried,M.
  TITLE     The MES-1 murine enhancer element is closely associated with the
            heterogeneous 5' ends of two divergent transcription units
  JOURNAL   Mol. Cell. Biol. 6, 4558-4569 (1986)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 7107)
  AUTHORS   Fried,M.
  JOURNAL   Unpublished (1988) ICRF, P.O.Box 123, London, WC2A 3PX.
  STANDARD  full staff_review
REFERENCE   3  (bases 1 to 286)
  AUTHORS   Huxley,C., Williams,T. and Fried,M.
  TITLE     One of the tightly clustered genes of the mouse surfeit locus is a
            highly expressed member of a multigene family whose other members
            are predominantly processed pseudogenes
  JOURNAL   Mol. Cell. Biol. 8, 3898-3905 (1988)
  STANDARD  full staff_entry
COMMENT
            [Mol. Cell. Biol. 6, 4558-4569 (1986)]  exons and partial introns
            only.
            
            Draft entry and computer readable sequence for [Unpublished (1988)
            ICRF, P.O.Box 123, London, WC2A 3PX.] kindly provided
            by M.Fried, 19-SEP-1988.
FEATURES             Location/Qualifiers
     prim_transcript <1..160
                     /note="Surf-3 mRNA and introns"
     intron          <1..13
                     /note="Surf-3 intron G"
     CDS             join(M21455:25..27,M21456:14..134,M21457:14..163,
                     M21458:14..154,M21459:14..93,M21460:14..144,M14692:14..83,
                     14..118)
                     /note="surfeit 3 protein"
                     /codon_start=25
     CDS             join(complement(3155..3229),complement(2853..2904),
                     complement(2497..2630),complement(2333..2415),
                     complement(1608..1796),complement(1052..1124),
                     complement(790..952),complement(426..507),
                     complement(285..354))
                     /note="surfeit 1 protein"
                     /codon_start=3229
     CDS             join(3324..3401,3603..3757,4325..4428,5777..5959,
                     6096..6262,6631..6717)
                     /note="surfeit 2 protein"
                     /codon_start=3324
     exon            14..118
                     /note="surfeit 3 protein, exon 8"
     prim_transcript complement(237..3233)
                     /note="Surf-1 mRNA and introns"
     exon            complement(285..354)
                     /note="surfeit 1 protein, exon 9"
     intron          complement(356..425)
                     /note="Surf-1, intron H"
     exon            complement(426..507)
                     /note="surfeit 1 protein, exon 8"
     intron          complement(508..798)
                     /note="Surf-1, intron G"
     exon            complement(790..952)
                     /note="surfeit 1 protein, exon 7"
     intron          complement(953..1051)
                     /note="Surf-1, intron F"
     exon            complement(1052..1124)
                     /note="surfeit 1 protein, exon 6"
     intron          complement(1125..1607)
                     /note="Surf-1, intron E"
     repeat_region   1383..1565
                     /note="B2 element"
     exon            complement(1608..1796)
                     /note="surfeit 1 protein, exon 5"
     intron          complement(1797..2332)
                     /note="Surf-1, intron D"
     exon            complement(2333..2415)
                     /note="surfeit 1 protein, exon 4"
     intron          complement(2416..2496)
                     /note="Surf-1, intron C"
     exon            complement(2497..2630)
                     /note="surfeit 1 protein, exon 3"
     intron          complement(2630..2852)
                     /note="Surf-1, intron B"
     exon            complement(2853..2904)
                     /note="surfeit 1 protein, exon 2"
     intron          complement(2905..3154)
                     /note="Surf-1, intron A"
     repeat_region   3005..3051
                     /note="direct repeat copy AA"
     repeat_region   3052..3098
                     /note="direct repeat copy BB"
     prim_transcript 3128..6909
                     /note="Surf-2 mRNA and introns"
     exon            complement(3155..3229)
                     /note="surfeit 1 protein, exon 1"
     repeat_region   3239..3248
                     /note="direct repeat copy A"
     repeat_region   3264..3273
                     /note="direct repeat copy B"
     exon            3324..3401
                     /note="surfeit 2 protein, exon 1"
     intron          3402..3602
                     /note="Surf-2, intron A"
     exon            3603..3757
                     /note="surfeit 2 protein, exon 2"
     intron          3758..4324
                     /note="Surf-2, intron B"
     repeat_region   4049..4193
                     /note="B1 element"
     exon            4325..4428
                     /note="surfeit 2 protein, exon 3"
     intron          4429..5776
                     /note="Surf-2, intron C"
     exon            5777..5959
                     /note="surfeit 2 protein, exon 4"
     intron          5960..6095
                     /note="Surf-2, intron D"
     exon            6096..6262
                     /note="surfeit 2 protein, exon 5"
     intron          6263..6630
                     /note="Surf-2, intron E"
     exon            6631..6717
                     /note="surfeit 2 protein, exon 6"
BASE COUNT     1776 a   1716 c   1785 g   1544 t    286 others
ORIGIN      About 262 bp after segment 7.
        1 tctgtcattt cagatccgtc gccactgggg aggcaacgtc ctgggtccta agtctgtggc
       61 tcgaattgcc aagctggaaa aagcaaaggc taaagaactc gccactaaat tgggttaaat
      121 gtacactaaa ttttctgtac ctaaatataa ttacaaaatt atcttgactg cctttggtta
      181 tttgggttgg cgcgagtgtg ccctgtaaaa cggtttcaga ctgagccatg gtttttacat
      241 agcaagatct ttattttatg gttgttggga acaggctcct tacttcacat gatgggtgtc
      301 cgacgtacaa atttttggaa ccacaaatat gatgtggccg cacacagtcc gtacctgtgg
      361 ttgaaagaga ttatagcctg tgcggggtgg ggtgggtgga gggggtagca ggcagcagga
      421 ctaaccaggt aaggatgtac tgcatgtgct cattgcgcag agtcactctc gtctgtcctc
      481 cgatgggccc gccgggggct gtgctgtctg tggagacagg agtcagctct gtcagacccc
      541 tagtgcacag agaagtctgc tggctgtttg gaacnnnnnn nnnnnnnnnn nnnnnnnnnn
      601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnccccaa gcaggctgac
      781 cacacatacg gaagtctgca tcaatgaaaa tggggtccgc tcctgttatc ttggccatag
      841 cttccaggtc tcgataatac cagtgattcc tttctgggct gttctccgga acaaagggct
      901 tcctgttttc tgtgagcctc actatgccaa ctaggtctac ttctcccaga acctacaaga
      961 gaaggcagga ctgcttttta cgggagtcct gtatggaggc aaggtgactt aactatatac
     1021 atttctcggg ctgaagagcc atatgcctta cctggccttt ctgtctggtc tcaggattca
     1081 ctttcttcct gggaacaaac cctctattaa ccaggatggt gactctagaa caataataaa
     1141 agtcaccctt tgggtgagga agagttccaa tacagatgtc actagtggtt acttcagcat
     1201 ttatctactg ggcatataaa tggtttattc tgtggattca gacttcactg ctaagcctta
     1261 gtgattcaaa ctggctaaga aaatcgctta atgggctgga gagatggctc agcagttagg
     1321 agcactgact gctcttccag agagaggtcc tgagttcaat tcccagcaac cacgtggtgg
     1381 ccccacaacc atctgtagtg gaatccgatg ccctcttctt ggtgtgtctg aagacagcta
     1441 caaagtgtac tcataaataa aatcttttaa aaaannnnnn nnnnnnnnnn nnnnnnnnnn
     1501 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1561 nnnnnnnnnn gactgagcac aatgaggctc agccatgcac tactcacccc aagtcagagc
     1621 aatggaaagg agtaactaca tgggccccac tttcagttga ggatagtctg ccagcatctc
     1681 gcgcctctcg gacaggatcc accatggtcc gaggcattat gtacaactct ttagagtggt
     1741 caaagtggcc cctgaccttc actggcctgt actccaaatt tttcagttcc attgggctat
     1801 aaggagaaaa ggaagggctg tgagtaagcc aggcttgtca atcctttaag cagaaaggct
     1861 cagtgaggat catcagacag gttatgtcct ccatagcaac ctttgatgcc aaatagtaat
     1921 agaagatggt gataaccaga gcccaggttt tcaaggaaag tgtgggtttc tgtatggcta
     1981 ctgtatatac attaacaaaa tgtgagtcca aactccactt gttcccaacc ctgaaggaca
     2041 aaactttaaa attcaagagg gcaatacaat aaactttaaa gaccaccacc caccagggcc
     2101 acaatgaact gtcaccactt tggaggatcc cttgagacct gaaaagggaa atctgacacc
     2161 gaaaataaat cttttctatt tatgttctgt gtccaatttc ataaaggcag tcagcaggca
     2221 gtttatcggg aaacagggtt ctcgccaaag gacagcaaat tacaccaaac agaagcaagg
     2281 gccccactgc cgccctgtaa gtggaatgaa taaagtgggc agccacactc actctgctgg
     2341 tagagggatg ggctcagcca tgactcgaga ctctaattct gcaataagtt tcagcttcca
     2401 tttccgacgt tggacctata gaaacagcat gagctaagct aaggtatcag tggagtccga
     2461 gcctcggagc cacccaaatt cgacccccat cttcacctgc caagtcccca ggccaaaagc
     2521 agtagcaggg attaaaagca ggaaccactg gagaaaagaa tcgtcctctg ctttagcggc
     2581 ggctgtttca gcagtagaac tgcaacacct gcgtggccta cagaccatcc ctggaggaag
     2641 ttttacaaca ctggcatggg gtcagaagcc ctagaacaaa cagaactgga tggccaattc
     2701 ctcgagaggc ccgattatac cccagagaat gtatggcaaa ggtggcaatt aacagcacat
     2761 gtgcccaacc ctctcccagt acaatacacg cagtaaacca ctgtgggctg aaaagtttgg
     2821 tgacacaccg caccaggacg agcccttctc acctgagcgg acagaaaacc caaagacgct
     2881 cctcctgaca gcgcagaact gggcctgcgg ggaccagaca ggccgagggg ctaaatccag
     2941 ggaaccccac gcctgccaag tgcaagcacc tgacacggtc caacccagcc ccacacaact
     3001 tcgagctaag tcttcttggg cagggtcctc cctggcaccg ctccacccag ggctaagttc
     3061 tcttgggcag ggtcctcctc ggccccactc cactccgggc ccggccctcc agatccctct
     3121 tcttgagcag gatcctctct gaacccccgc ttacccgtcc cgcgtaggcc cattgcgacc
     3181 accgcgtcat ccgtcgcggc agaccagcca aagccatcac agcagccatc tttgagcact
     3241 tccgggaccg agaaatctgc ttccttccgg gacggcgctc tgtctcacgt ggtggctgcg
     3301 gcgtcagatg ggcatctccg atcatggacg aaccaccctc tgatgtgctt gcattcttgc
     3361 gccagcaccc cagcttgaga ctactgccca acacccgcaa ggtcggaaga gaaccgtgac
     3421 cggaagagta gggccaagaa ggaacgggat ctgggaggag cggaacctcg gaggaccgac
     3481 agtggccaga aaggaggggg acccgggagt attgtgccta gaaagcgtgg gacctgggag
     3541 gacagtgcag ggaaggggcg gagctggcaa gaaccacctt ctgaccagat gctcatccgc
     3601 aggttcgctg ctccctaact ggccatgagc tgccctgccg tctgcccgag ctccaggaat
     3661 acacccgcgg caagaagtac caacggctgt caagttcctt ctctaacttc gattacgcag
     3721 ctttcgagcc acacattgtg cccagcacaa agaatcggta cgtagtttgg ccggccagcg
     3781 cctaggagca gagcctgtgc cccctgttct gagcttgtag gagcctccca ctgtgacact
     3841 tatcaggaca gtgcttgtgg cccttgcatg tagttccctg aagccaggct gggcagtgcc
     3901 agtttcactg tttggggatc ctgtctatgg ttgtgtggtt gagtatatcc tcatatagca
     3961 ctccctgagc tgtctgaccg ccaagcatgc agtgtgaaac tgcaatttct tgggtaacaa
     4021 tctgggtcct tgatcccaga tactgagtgg aattgtttgt aagcctgtgt agctgggttg
     4081 gcctggaact cactatgtag gccagactga ccttaaagcc tgcctctgac tctgcccctc
     4141 tgcctctgcc acctgagtgc tggaattaaa gtccaccatg cctttttcaa aatccagaca
     4201 tgagactaaa tgcagagcat cccagcagat ggatcatttg tttgcagcat gcagccatgg
     4261 tggagcccac aagactgggc ccttggtgtg catgcccctc catacccacc tgtttgcctt
     4321 ttaggcacca actgttctgc aaactcaccc tgaggcacat caataagtcc ccagaacacg
     4381 tgctgaggca cacccagggc cggaggtatc agagagcact tcatcaatgt aagtcacccc
     4441 ccagaagctc aggcctttgg atgcatgctc cctgccccca aaaagccaga aaggtgcctt
     4501 ggccatcact gatcctttat tgatgcactt gcctgatctt gcacagtgag aagcaaagcc
     4561 agaagtaggt gggaaggagt gcggtggagg tctgatattc catagcaggg acaatcctag
     4621 ggtagctccc aggctgaggc tgcagtattc tagttatagg acaggtccat agtgcatctg
     4681 atactgcagg agagccttga agagctgttc tcaggagcag gattggtgca aacagtcctg
     4741 caggctccag agggtgaggg gtcttgcagc taaggggttg gtggcttgag caggaaaggg
     4801 gacttgcaga cagcctccag tattgttggg tatagccagg ggtgggaagg catcccaagt
     4861 cattgagagt agagactcct ggggttaggg acagagagag gatggggcca gcttgagcat
     4921 gagggaggtg aaaacctccc atccatgcta ccacaggaag atgagggtaa gaaagggact
     4981 gggactaggt aacttggatg gaggaatgaa gcagggagaa tggagggttg gggttcgcag
     5041 aggctgggga tgagtgggag gcttcaggga ctgcatcgcc tgtcctgccc taacaggcag
     5101 acttcagtca cagggtggga aggtaaggaa tctgtcagtt gtctgctggt gcagagttgg
     5161 cacctgggct gcactgaccg gtggcaagct aaggtttggc tgttgtacct ttgttagctt
     5221 tgtcatctgc tacattttca cagggagggc tgtacaggaa tccccatgat tataaagatg
     5281 tatcaaagag ctgagacaga gccaaaacac agtgaggccg gttagatggc aggctggtta
     5341 ccgtagctaa ttggcctgct gagagcactg ggaggagacc acagtggtcc tagaaataga
     5401 tgtcaaagat gaagcaggga agactggaga tagcagagca gttggttcta tcagatgccc
     5461 agtcccaatt agactgttac tcatgagcaa tgagctggta tccctcacct atggagatgg
     5521 tgctggccca agaagttcag actcctcagt acctgaggac agcatgtgac cctagtaagt
     5581 gccttccctg aagcctcatt cccgcgccaa gttgaagtga agccaggcag tttcccctga
     5641 ggtccaaccc tttggctgcc cacccctatc tggtaggata aaggaacaga accatggcac
     5701 aatacaacac aatacaaccc atagcttagc tgtgcaccaa gagggtacca gtgtaatgag
     5761 cttgctagtc tcctagatga agagtgtcag aaacaaggtg tggaatatgt ccctgcctgc
     5821 cttctacaca agaggaagaa gagagaggac cagacgaaca gtgatgaact cccaggccag
     5881 agaacaggtt tctgggagcc agcttccagt gacgaggaag acgccttgag tgacgacagc
     5941 atgacagacc tatacccacg taagcagacc cagtccaatc cctgcccctg tctcctgcta
     6001 ggtaccttgt gatcctgctg tgagccactt ttcctaagaa agggagggat gcttttacct
     6061 gtgacacagc tcagccaagt gtttgttcct tccagctgag ctgttcacaa agagagaact
     6121 aggcaagcct aagaacgatg acactcctga agactttctg acagaccaac aggatgagaa
     6181 gccggagcat tcagaagaga agagctttag agagagggaa gaggccagag tgggccacaa
     6241 gaggggtcgt aaactgagga aggtgagtgg tgggctgcag agtgctgagg gtagggggct
     6301 actcagcaaa ttctgcaagt gcacttgtcg ggtagcagcc taaaatatgg aagccaaggc
     6361 cactttagaa ggtctgcagg agtaagtcct gactgctcag gaacttcact gctgggtact
     6421 ccctctctat gggacaggac tgagtattct gtcagagagg agcaaggtag acaaagtctc
     6481 tagtcacctg tgttgccctc tcccaggccc cacactgacc cagcaataga agtgttgaga
     6541 aggcagactt aaggctcttc cggcttagac acacagtgcc tttccctgtg aacctgcaga
     6601 ggccctggtc agcattttct gctttcacag aagcagctca cctccttgac caagaagttc
     6661 aagagctatc atcacaagcc caagaacttc agttccttta agcagctggg gagatgaagc
     6721 ctgcaaaaaa agcctttgga tgtccttgga aatggccact gatatcctga cattggctgt
     6781 tgtccagagt gggcctggag gttctgttca gaagatggca gcgtcgggga cttgcatggc
     6841 tttctctgca gtgttgagtg actcttgctg tagctcactg tagcatcccg aagcctagcc
     6901 aggctcccag ctctggctgc tcaccatata ggaccacact atataccaca aatgccatta
     6961 tttattttga tgtttccaaa gatcaaaaca ttttcaaaca caactaagat ataaaataca
     7021 gcataaaaat gaggtttata cctattccca cataaagcta aagcatcttc agaaaacttt
     7081 tgccaaacca atatgtgtat cattgtt
//