GenBank-Updates@genbank.bio.net (09/23/90)
LOCUS DDID11 2387 bp ds-DNA INV 23-SEP-1990 DEFINITION Slime mold (D.discoideum) prestalk D11 gene, complete cds. ACCESSION M11012 KEYWORDS glycoprotein; prestalk D11 protein. SOURCE D.discoideum DNA, clone pD11G14D. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 2387) AUTHORS Barklis,E., Pontius,B. and Lodish,H.F. TITLE Structure of the Dictyostelium discoideum prestalk D11 gene and protein JOURNAL Mol. Cell. Biol. 5, 1473-1479 (1985) STANDARD full staff_review COMMENT Draft entry and clean copy sequence for [1] kindly provided by E.Barklis, 24-SEP-1985. The D11 message is a typical class I prestalk mRNA, which is present at low levels in amoeboid and early developing cells and accumulates to 10- to 20- fold higher levels during later stages of development. The D11 protein has 42 cysteine residues (16.3%). The primary structure of the D11 protein consists of a hydrophobic leader sequence followed by a series of repeated domains. Thirteen of the 25 residues in the leader are leucine, valine or isoleucine. Thirtyfour AAs in the A1 and A2 domains are identical. Substitutions conserve charged and hydrophobic residues. The B elements are highly conserved with the exception of AA five and six. The C domains are identical for the first seven residues in C1-C3. C4 matches the others only at AA four and twelve. A TATA box is located at positions 208-214 and poly-A signals are found at 1118-1123 and 1129-1134. FEATURES from to/span description pept 274 1122 prestalk D11 precursor protein sigp 274 348 prestalk D11 protein leader peptide (putative) matp 349 1122 prestalk D11 protein mRNA 244 1134 D11 mRNA rpt 349 465 D11 repeat A1 rpt 466 519 D11 repeat B1 rpt 535 651 D11 repeat A2 rpt 652 705 D11 repeat B2 rpt 706 750 D11 repeat C1 rpt 751 849 D11 repeat C2 rpt 850 903 D11 repeat B4 rpt 904 951 D11 repeat C3 rpt 952 1005 D11 repeat B5 rpt 1006 1053 D11 repeat C4 rpt 1054 1101 D11 repeat B6 site 2062 2238 putative VECTOR sequence pBR322 BASE COUNT 887 a 306 c 341 g 853 t ORIGIN BglII site. 1 gatctaatta aaaaaattta caaaaaaaaa ataaataaaa aaaaaataaa aaaaatcgtt 61 tttattataa tgggggagtg ttggaagatt ttattttttt ataattatta tttttttatt 121 ttatatgttg gtatatgtat atgtattata taagaatgtt tatatgatat tataattttt 181 tttttttcat ataaaaaaaa atattaatat aaataaaaca tatttttttt gttgtttttt 241 tttacattat taaaatttta gaaaagcaca aaaatgttaa ataaactaat acgattatta 301 attctatcaa gttgtttggt actatcagtt aaaagtgaag ttaatgttga ttgctccctc 361 gttagatgtg cccaaccaat atgtaaacct cattatagat taaacatgac cgattcttgt 421 tgtggtcgtt gtgaaccttg taccgatgtt gcatgtactc ttcaagtcaa atattgtcaa 481 gatggtgaag ttccaaccgg ttgttgtcca tgtactctcc caccaaccaa accagattgc 541 tcccttgtta aatgtgctac accagtatgt aaaccatatt atagattaaa catgaccgat 601 tcttgttgtg gtcgttgtga accatgtaca ggtgtcgcat gtactcttca aatcaaatat 661 tgtcaagatg gtgaagttcc aactggttgt tgtccatgta cactcaacca actaaaaaac 721 cagattgttc aagagttcat gtccaagatt ttaaaatatt gccaagaagg tgaacttcca 781 actggttgtt gtccatgtac actcaaccaa ctaaaaaacc agattgttca agagttcatg 841 tcaaagattt tgaaatactg taaagaaggt gaacttccaa ctggttgttg tccatgtaca 901 ctcaaccaac taaaaaacca gattggtgct gatgtaatgt gtacagtgac attagatagt 961 tgtaaaaatg gtgagcttcc aactggctgc tgtccttgca cacctcaaga aaccaaagtt 1021 cggagatggt ctaaagctat gtgcacaatg catattaaat attgtaaacc aggtgaaaaa 1081 ccctttggtt gttgcccatg tcgtgaaaat ttaactcaat aaaaaattaa taaaatatat 1141 atatttgttt tttaattaat ttaaataata tatcatcttt tttttttcta ataatagatt 1201 tagtttgtcc aataataaaa aatcaaaaat actaataagt ggtgagagtt taaataataa 1261 cacactatta aaagtttata aaaacaaaag gcgaaaatga cccacttttt atttatacac 1321 acaaaaaaat aataaaaaat aataaaaaat aataaaaaaa ttaaaataaa aaaataaaaa 1381 aataaaaaaa taaaaaaata aattaataaa ttaataaatt aaaaaagaat caaaaaattt 1441 aatactcatt tgaaaattaa ggatttcaaa aatgaaaacg ataatactat tattatcaat 1501 acttaatttg tatattttgt tcaatgggca aggtatgttt tttttttttt ttattttttt 1561 ttttttttaa atttaatatc ttttaaaata ttaaatatat ttctcattat catcattatg 1621 cattacaaat acctcaagat gatatttaag acatggcaat ttaaaccaga tagaaattta 1681 gattttggat tggatctttc cagtttagac gaagatatta ccacaaattt ttaaaaatta 1741 ttcagataca atttcaaatc caggtttttg ttgtgtatgt atctattttt taaatgaatt 1801 gaaagaaaat actaattttt tatttattta tctaaatttt agggtgatag tgaagttgaa 1861 tcaactaatg atccaaatgt aattgttgtg catggagaaa attcagttag tcaatggttt 1921 gccgatgctc catatggcgt tgttgatgac tctcaatgtc catcctatag agttgatttt 1981 catttgaata ttttatagaa tgtccagatt gtaattttgg agaattagtc atatacattt 2041 catactctga tattggtttt aataaccaaa cattatttaa ttatgataac gatattgatc 2101 ttcagcatct tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaatgcg 2161 caaaaaaggg aataagggcg acaggaaatg ttgaatactc atactcttcc ttttcaatat 2221 tattgagcat ttatcaggaa tggcatgata atcattagtg atgacaagag tatggttcag 2281 cgaaagtcat atggagaaca ccttattttt ggaaatgatt atcttaatca tactcatact 2341 gctaatattt tcggttatca acatatggat aatgttgacg agaattc //