GenBank-Updates@genbank.bio.net (09/23/90)
LOCUS DDID11 2387 bp ds-DNA INV 23-SEP-1990
DEFINITION Slime mold (D.discoideum) prestalk D11 gene, complete cds.
ACCESSION M11012
KEYWORDS glycoprotein; prestalk D11 protein.
SOURCE D.discoideum DNA, clone pD11G14D.
ORGANISM Dictyostelium discoideum
Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida;
Dictyosteliidae.
REFERENCE 1 (bases 1 to 2387)
AUTHORS Barklis,E., Pontius,B. and Lodish,H.F.
TITLE Structure of the Dictyostelium discoideum prestalk D11 gene and
protein
JOURNAL Mol. Cell. Biol. 5, 1473-1479 (1985)
STANDARD full staff_review
COMMENT Draft entry and clean copy sequence for [1] kindly provided by
E.Barklis, 24-SEP-1985.
The D11 message is a typical class I prestalk mRNA, which is
present at low levels in amoeboid and early developing cells and
accumulates to 10- to 20- fold higher levels during later stages of
development. The D11 protein has 42 cysteine residues (16.3%).
The primary structure of the D11 protein consists of a hydrophobic
leader sequence followed by a series of repeated domains. Thirteen
of the 25 residues in the leader are leucine, valine or isoleucine.
Thirtyfour AAs in the A1 and A2 domains are identical.
Substitutions conserve charged and hydrophobic residues. The B
elements are highly conserved with the exception of AA five and
six. The C domains are identical for the first seven residues in
C1-C3. C4 matches the others only at AA four and twelve.
A TATA box is located at positions 208-214 and poly-A signals are
found at 1118-1123 and 1129-1134.
FEATURES from to/span description
pept 274 1122 prestalk D11 precursor protein
sigp 274 348 prestalk D11 protein leader peptide (putative)
matp 349 1122 prestalk D11 protein
mRNA 244 1134 D11 mRNA
rpt 349 465 D11 repeat A1
rpt 466 519 D11 repeat B1
rpt 535 651 D11 repeat A2
rpt 652 705 D11 repeat B2
rpt 706 750 D11 repeat C1
rpt 751 849 D11 repeat C2
rpt 850 903 D11 repeat B4
rpt 904 951 D11 repeat C3
rpt 952 1005 D11 repeat B5
rpt 1006 1053 D11 repeat C4
rpt 1054 1101 D11 repeat B6
site 2062 2238 putative VECTOR sequence pBR322
BASE COUNT 887 a 306 c 341 g 853 t
ORIGIN BglII site.
1 gatctaatta aaaaaattta caaaaaaaaa ataaataaaa aaaaaataaa aaaaatcgtt
61 tttattataa tgggggagtg ttggaagatt ttattttttt ataattatta tttttttatt
121 ttatatgttg gtatatgtat atgtattata taagaatgtt tatatgatat tataattttt
181 tttttttcat ataaaaaaaa atattaatat aaataaaaca tatttttttt gttgtttttt
241 tttacattat taaaatttta gaaaagcaca aaaatgttaa ataaactaat acgattatta
301 attctatcaa gttgtttggt actatcagtt aaaagtgaag ttaatgttga ttgctccctc
361 gttagatgtg cccaaccaat atgtaaacct cattatagat taaacatgac cgattcttgt
421 tgtggtcgtt gtgaaccttg taccgatgtt gcatgtactc ttcaagtcaa atattgtcaa
481 gatggtgaag ttccaaccgg ttgttgtcca tgtactctcc caccaaccaa accagattgc
541 tcccttgtta aatgtgctac accagtatgt aaaccatatt atagattaaa catgaccgat
601 tcttgttgtg gtcgttgtga accatgtaca ggtgtcgcat gtactcttca aatcaaatat
661 tgtcaagatg gtgaagttcc aactggttgt tgtccatgta cactcaacca actaaaaaac
721 cagattgttc aagagttcat gtccaagatt ttaaaatatt gccaagaagg tgaacttcca
781 actggttgtt gtccatgtac actcaaccaa ctaaaaaacc agattgttca agagttcatg
841 tcaaagattt tgaaatactg taaagaaggt gaacttccaa ctggttgttg tccatgtaca
901 ctcaaccaac taaaaaacca gattggtgct gatgtaatgt gtacagtgac attagatagt
961 tgtaaaaatg gtgagcttcc aactggctgc tgtccttgca cacctcaaga aaccaaagtt
1021 cggagatggt ctaaagctat gtgcacaatg catattaaat attgtaaacc aggtgaaaaa
1081 ccctttggtt gttgcccatg tcgtgaaaat ttaactcaat aaaaaattaa taaaatatat
1141 atatttgttt tttaattaat ttaaataata tatcatcttt tttttttcta ataatagatt
1201 tagtttgtcc aataataaaa aatcaaaaat actaataagt ggtgagagtt taaataataa
1261 cacactatta aaagtttata aaaacaaaag gcgaaaatga cccacttttt atttatacac
1321 acaaaaaaat aataaaaaat aataaaaaat aataaaaaaa ttaaaataaa aaaataaaaa
1381 aataaaaaaa taaaaaaata aattaataaa ttaataaatt aaaaaagaat caaaaaattt
1441 aatactcatt tgaaaattaa ggatttcaaa aatgaaaacg ataatactat tattatcaat
1501 acttaatttg tatattttgt tcaatgggca aggtatgttt tttttttttt ttattttttt
1561 ttttttttaa atttaatatc ttttaaaata ttaaatatat ttctcattat catcattatg
1621 cattacaaat acctcaagat gatatttaag acatggcaat ttaaaccaga tagaaattta
1681 gattttggat tggatctttc cagtttagac gaagatatta ccacaaattt ttaaaaatta
1741 ttcagataca atttcaaatc caggtttttg ttgtgtatgt atctattttt taaatgaatt
1801 gaaagaaaat actaattttt tatttattta tctaaatttt agggtgatag tgaagttgaa
1861 tcaactaatg atccaaatgt aattgttgtg catggagaaa attcagttag tcaatggttt
1921 gccgatgctc catatggcgt tgttgatgac tctcaatgtc catcctatag agttgatttt
1981 catttgaata ttttatagaa tgtccagatt gtaattttgg agaattagtc atatacattt
2041 catactctga tattggtttt aataaccaaa cattatttaa ttatgataac gatattgatc
2101 ttcagcatct tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaatgcg
2161 caaaaaaggg aataagggcg acaggaaatg ttgaatactc atactcttcc ttttcaatat
2221 tattgagcat ttatcaggaa tggcatgata atcattagtg atgacaagag tatggttcag
2281 cgaaagtcat atggagaaca ccttattttt ggaaatgatt atcttaatca tactcatact
2341 gctaatattt tcggttatca acatatggat aatgttgacg agaattc
//