[bionet.molbio.genbank.updates] ACCESSION M11012

GenBank-Updates@genbank.bio.net (09/23/90)

LOCUS       DDID11       2387 bp ds-DNA             INV       23-SEP-1990
DEFINITION  Slime mold (D.discoideum) prestalk D11 gene, complete cds.
ACCESSION   M11012
KEYWORDS    glycoprotein; prestalk D11 protein.
SOURCE      D.discoideum DNA, clone pD11G14D.
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 2387)
  AUTHORS   Barklis,E., Pontius,B. and Lodish,H.F.
  TITLE     Structure of the Dictyostelium discoideum prestalk D11 gene and
            protein
  JOURNAL   Mol. Cell. Biol. 5, 1473-1479 (1985)
  STANDARD  full staff_review
COMMENT     Draft entry and clean copy sequence for [1] kindly provided by
            E.Barklis, 24-SEP-1985.
            
            The D11 message is a typical class I prestalk mRNA, which is
            present at low levels in amoeboid and early developing cells and
            accumulates to 10- to 20- fold higher levels during later stages of
            development.  The D11 protein has 42 cysteine residues (16.3%).
            
            The primary structure of the D11 protein consists of a hydrophobic
            leader sequence followed by a series of repeated domains.  Thirteen
            of the 25 residues in the leader are leucine, valine or isoleucine.
            Thirtyfour AAs in the A1 and A2 domains are identical.
            Substitutions conserve charged and hydrophobic residues.  The B
            elements are highly conserved with the exception of AA five and
            six.  The C domains are identical for the first seven residues in
            C1-C3.  C4 matches the others only at AA four and twelve.
            
            A TATA box is located at positions 208-214 and poly-A signals are
            found at 1118-1123 and 1129-1134.
FEATURES       from  to/span     description
    pept        274     1122     prestalk D11 precursor protein
    sigp        274      348     prestalk D11 protein leader peptide (putative)
    matp        349     1122     prestalk D11 protein
    mRNA        244     1134     D11 mRNA
    rpt         349      465     D11 repeat A1
    rpt         466      519     D11 repeat B1
    rpt         535      651     D11 repeat A2
    rpt         652      705     D11 repeat B2
    rpt         706      750     D11 repeat C1
    rpt         751      849     D11 repeat C2
    rpt         850      903     D11 repeat B4
    rpt         904      951     D11 repeat C3
    rpt         952     1005     D11 repeat B5
    rpt        1006     1053     D11 repeat C4
    rpt        1054     1101     D11 repeat B6
    site       2062     2238     putative VECTOR sequence pBR322
BASE COUNT      887 a    306 c    341 g    853 t
ORIGIN      BglII site.
        1 gatctaatta aaaaaattta caaaaaaaaa ataaataaaa aaaaaataaa aaaaatcgtt
       61 tttattataa tgggggagtg ttggaagatt ttattttttt ataattatta tttttttatt
      121 ttatatgttg gtatatgtat atgtattata taagaatgtt tatatgatat tataattttt
      181 tttttttcat ataaaaaaaa atattaatat aaataaaaca tatttttttt gttgtttttt
      241 tttacattat taaaatttta gaaaagcaca aaaatgttaa ataaactaat acgattatta
      301 attctatcaa gttgtttggt actatcagtt aaaagtgaag ttaatgttga ttgctccctc
      361 gttagatgtg cccaaccaat atgtaaacct cattatagat taaacatgac cgattcttgt
      421 tgtggtcgtt gtgaaccttg taccgatgtt gcatgtactc ttcaagtcaa atattgtcaa
      481 gatggtgaag ttccaaccgg ttgttgtcca tgtactctcc caccaaccaa accagattgc
      541 tcccttgtta aatgtgctac accagtatgt aaaccatatt atagattaaa catgaccgat
      601 tcttgttgtg gtcgttgtga accatgtaca ggtgtcgcat gtactcttca aatcaaatat
      661 tgtcaagatg gtgaagttcc aactggttgt tgtccatgta cactcaacca actaaaaaac
      721 cagattgttc aagagttcat gtccaagatt ttaaaatatt gccaagaagg tgaacttcca
      781 actggttgtt gtccatgtac actcaaccaa ctaaaaaacc agattgttca agagttcatg
      841 tcaaagattt tgaaatactg taaagaaggt gaacttccaa ctggttgttg tccatgtaca
      901 ctcaaccaac taaaaaacca gattggtgct gatgtaatgt gtacagtgac attagatagt
      961 tgtaaaaatg gtgagcttcc aactggctgc tgtccttgca cacctcaaga aaccaaagtt
     1021 cggagatggt ctaaagctat gtgcacaatg catattaaat attgtaaacc aggtgaaaaa
     1081 ccctttggtt gttgcccatg tcgtgaaaat ttaactcaat aaaaaattaa taaaatatat
     1141 atatttgttt tttaattaat ttaaataata tatcatcttt tttttttcta ataatagatt
     1201 tagtttgtcc aataataaaa aatcaaaaat actaataagt ggtgagagtt taaataataa
     1261 cacactatta aaagtttata aaaacaaaag gcgaaaatga cccacttttt atttatacac
     1321 acaaaaaaat aataaaaaat aataaaaaat aataaaaaaa ttaaaataaa aaaataaaaa
     1381 aataaaaaaa taaaaaaata aattaataaa ttaataaatt aaaaaagaat caaaaaattt
     1441 aatactcatt tgaaaattaa ggatttcaaa aatgaaaacg ataatactat tattatcaat
     1501 acttaatttg tatattttgt tcaatgggca aggtatgttt tttttttttt ttattttttt
     1561 ttttttttaa atttaatatc ttttaaaata ttaaatatat ttctcattat catcattatg
     1621 cattacaaat acctcaagat gatatttaag acatggcaat ttaaaccaga tagaaattta
     1681 gattttggat tggatctttc cagtttagac gaagatatta ccacaaattt ttaaaaatta
     1741 ttcagataca atttcaaatc caggtttttg ttgtgtatgt atctattttt taaatgaatt
     1801 gaaagaaaat actaattttt tatttattta tctaaatttt agggtgatag tgaagttgaa
     1861 tcaactaatg atccaaatgt aattgttgtg catggagaaa attcagttag tcaatggttt
     1921 gccgatgctc catatggcgt tgttgatgac tctcaatgtc catcctatag agttgatttt
     1981 catttgaata ttttatagaa tgtccagatt gtaattttgg agaattagtc atatacattt
     2041 catactctga tattggtttt aataaccaaa cattatttaa ttatgataac gatattgatc
     2101 ttcagcatct tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaatgcg
     2161 caaaaaaggg aataagggcg acaggaaatg ttgaatactc atactcttcc ttttcaatat
     2221 tattgagcat ttatcaggaa tggcatgata atcattagtg atgacaagag tatggttcag
     2281 cgaaagtcat atggagaaca ccttattttt ggaaatgatt atcttaatca tactcatact
     2341 gctaatattt tcggttatca acatatggat aatgttgacg agaattc
//