[bionet.molbio.genbank.updates] Escherichia coli DNA for genes involved in synthesis of CS3 pili

GenBank-Updates@genbank.bio.net (05/27/91)

LOCUS       ECOCS3P      4746 bp ds-DNA             BCT       27-MAY-1991
DEFINITION  Escherichia coli DNA for genes involved in synthesis of CS3 pili
ACCESSION   X16944
KEYWORDS    adhesion factor; fimbrial biosynthesis; pilin.
SOURCE      Escherichia coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 4746)
  AUTHORS   Manning,P.A.
  JOURNAL   Unpublished (1989)
  STANDARD  full automatic
REFERENCE   2  (bases 1 to 4746)
  AUTHORS   Jalajakumari,M.B., Thomas,C.J., Halter,R. and Manning,P.A.
  TITLE     Genes for biosynthesis and assembly of CS3 pili of CFA/II
            enterotoxigenic Escherichia coli: novel regulation of pilus
            production by bypassing an amber codon
  JOURNAL   Mol. Microbiol. 3, 1685-1695 (1989)
  STANDARD  full automatic
COMMENT     SWISS-PROT; P15483; CS31$ECOLI. SWISS-PROT; P15484; CS32$ECOLI.
            SWISS-PROT; P15485; CS33$ECOLI. SWISS-PROT; P15486; CS34$ECOLI.
            SWISS-PROT; P15487; CS35$ECOLI. SWISS-PROT; P15488; FMC3$ECOLI.
            
            *source: strain=PB176; clone=pPM484; Amber stop codon at 3577-3579
            can be read through, resulting in a 104kD protein.
            
            From EMBL    entry ECCS3P;  dated 13-FEB-1990.
FEATURES             Location/Qualifiers
     CDS             378..1100
                     /note="27kD protein (AA 1 to 241)"
                     /codon_start=378
     misc_feature    1318..4128
                     /note="104kD protein (AA 1 to 937)"
     CDS             1858..3576
                     /note="63kD protein (AA 1 to 573)"
                     /codon_start=1858
     CDS             2266..3576
                     /note="48kD protein (AA 1 to 437)"
                     /codon_start=2266
     CDS             2668..3576
                     /note="33kD protein (AA 1 to 303)"
                     /codon_start=2668
     CDS             3031..3576
                     /note="20kD protein (AA 1 to 182)"
                     /codon_start=3031
     misc_feature    3577..3579
                     /note="Amber stop codon"
     CDS             4153..4656
                     /note="precursor polypeptide (AA -22 to 146)"
                     /codon_start=4153
     CDS             4153..4218
                     /note="signal peptide (AA -22 to -1)"
                     /codon_start=4153
     CDS             4219..4656
                     /note="CS3 pilin (AA 1 to 146)"
                     /codon_start=4219
BASE COUNT     1588 a    745 c    957 g   1456 t
ORIGIN
        1 aagcttcacg acatagcggg gaggtttgct tctttgagag gcgggtttac gtttacgggg
       61 tttagctgaa cgggccatat aaccacctga aagacaatga catttcctgt ttttataacg
      121 gtaattgcag accatgacaa gccacagccg tcaggctgtc tactcggcat tgttatctct
      181 ttaaaacatt gaggtgaagc tatgctgaca caggaggtaa ttacccaatc tgaataagaa
      241 ttattgggtg atctcctccc atgaaaatac gcacgcgaga agtgatatag atggaatgtt
      301 gtgttttttt atcaaaatta tatttgttta tggagtatta taacaataag ttattgacgc
      361 ttatgctagg agaaagaatg acacctatta agctaatttt tgcagctctg tctttatttc
      421 catgcagtaa catttatgca aacaatataa ccactcagaa attcgaagct atattgggtg
      481 caacaagagt aatttaccac ctagatggta atggtgaaag tctaagagtt aaaaatccgc
      541 agattagtcc aattctaatt caatctaaag taatggacga gggtagtaaa gataatgcgg
      601 attttattgt taccccccct ctttttagac tagatgcaaa aagagaaact gacattcgta
      661 tagttatggt gaatggctta tacccaaaag acagggaatc tctaaagacc ctctgtgtgc
      721 gaggaattcc accaaaacaa ggagatttat gggctaacaa tgaaaaagaa tttgttggaa
      781 tgaaacttaa cgtttcaatt aacacatgta ttaaattaat attaagacca cataatcttc
      841 ctaaacttga tattaattcc gaagggcaga tagaatgggg gataagggat ggtaatttag
      901 tagcaaagaa taaaacacct tactatttta ctatagtaaa tgcatcgttt aatggaaagg
      961 cactcaaaac accggggacg ctagggccgt atgagcaaaa actttacacg ctacctagta
     1021 aaatttctgt atctggactg gtaaagtggg aaattattgg tgatctaggt gagagcagtg
     1081 aaacaaagaa attcaatatt tgaagaatta aaagtgtact aaaaactgtc gagctaaact
     1141 attcgtacta ttatttttat gtgattctgt taatgcagaa aaatatatat ttgagcgaga
     1201 tttccttgct gattctgaaa aaattgattt aacattattg gagtcaagtg cctacccctc
     1261 tggtcgttat tatgttagtt tgtatttgaa tggggaatac attacaaaag aatgatgatg
     1321 tactttgacg ctggagaaag tgaggatttt tgtattcagt actctgtact acaggatata
     1381 ggtgtaactg tgagtgggaa tcaggatgaa tgtgcaaatc ttgatgatga attaaactta
     1441 agaaccaggt ttgattttta ctcgaaaaga atggatattt ttgtatcacc aaagtttgtt
     1501 ccacgaaaaa aaaacggtct tgcgccaatt aaactttggg atgagggtga aaatgcgcta
     1561 ttcacaagtt acaactttag tgaggattat taccatttta aaggtgacgc aagagatagt
     1621 tattcacaat acgctaacat tcaaccacgc ttaaatatag gaccatggag aataagaact
     1681 caagccatat ggaataaaaa taataacaca aaaggggagt ggagtaataa ttacctgtat
     1741 gccgaaagag gcttaggaaa tataaagagt agactataca ttggggatgg atattttcca
     1801 ttaaaaaact ttaattcgtt caaatttaaa ggaggggtgc taaaaactga tgagaatatg
     1861 tatccctatt cagaaaaaac ttattcacca atagttaaag gctcggcaaa aactcaagca
     1921 aaagttgaat tttttcagga tggtgtaaaa atttatagct caatcgtccc tccaggggat
     1981 ttttctatct cagattatat tttatcaggc tcaaatagtg atctttatgt caaagttata
     2041 gaggaaaatg gctcaattca ggaatttatc gttccattta cctatcctgc agttgcggtc
     2101 cgggaaggat ttacctatta tgaaatcgct atgggagaga ctcagcagtc gaatgattat
     2161 tttacacagt tatcatttac tcgtgggctt ccatatgact ttaccgtact tacatcttta
     2221 gaatattctg gcttctacag atctcttgaa attgggttag ggaaaatgct tgggaatttg
     2281 ggcgcattat cgttaatcta tggacagtca aactttagta aaagtgataa tagtaaaaat
     2341 aaaaaatggg atatcagata taataaaaat attccggacc taaatacata tttgagtttt
     2401 tctgctgtta gccaaactag aggggggtat tcttcactca gggatgcttt ggactatgag
     2461 atcggagaat atacttttaa ctcaaaaaac tcctatacag cctcaataaa ccactcatta
     2521 ggagagcttg gtagtttaaa ctttagtgga acatggcgaa actactggga gaataagaac
     2581 caaaccagat cttacaattt atcatattct acacaaatct ttaatggaaa ggcctacttg
     2641 tcaggaagtt tgattagaag tgaacttatg aattttaata ataagataag tgatactatt
     2701 ttaaatatcg gtgttaatat tccctttggc ctttctcgtg gcattcaatc tgtaagttat
     2761 aacaccagtt cagtgaaagg ggggaggagt actcatcagt tagggataag tggttctgaa
     2821 tttgacaata aattgtactg gcatgtaaat cagggttact cagataatta cagtaatacc
     2881 tctatgtatg gttattataa agctaagtat gctcaggtta atgccggata ctcagtttct
     2941 gagagataca atcatgctta tggaggtata gagggaggaa ttctggtata tgacggtgga
     3001 attattttag gtcgcaatct tggtgataca atgtcaatta ttgaagctcc aggtgcggaa
     3061 aatacaaaga ttagaggatg gggatcgatt gaaactgatt ggagggggag ggcttttatt
     3121 ggttatcttt caccttacca aaataatgat atatcccttg acccatcatc attaccatta
     3181 gactcctctt tagatatcac aacaaattcg gttattccaa caactggtgc aattgttaaa
     3241 acgacatata atgttaaaaa aggaaaaaaa gtaatgctta ctttaaaaaa gtcaaatggt
     3301 gatgcagttc catttggagc aattgtgaca gttatggatg gcgatcaaaa tacaagcatt
     3361 gtgggcgata atgggcaatt gtatttaggt tcctcaatgg atacaggaag gctaaaagtt
     3421 atatggggaa atggcgaaga taaaaaatgt gttgttgact acatagtagg tgacaataaa
     3481 aatatagcgg gtatttatat aggcagtgcc gaacatgtat ttagctcaat gctcctttat
     3541 ggcaaaaaaa tatctttttt atccgcttct gtttggtagg ttataggtgt tgttaaagcg
     3601 tttctgacaa ctctgcaatc caataacgaa tggagaacac acagtgaaaa aaatgatttt
     3661 agcattgact ttgatgtcgg tgtggggagg tcgtttgccg cagtgggccc aacgaaagat
     3721 atgagtttag gtgcaaattt aacttcagag cctacattag ctattgattt tacgcctatt
     3781 gaaaatattt atgtaggtgc caattatggt aaagatattg gaacccttgt tttcacaaca
     3841 aatgatttaa cagatattac attgatgtca tctcgcagcg ttgttgatgg tcgccagact
     3901 ggttttttta ccttcatgga ctcatcagcc acttacaaaa ttagtacaaa actgggatca
     3961 tcgaatgatg taaacattca agaaattact caaggagcta aaattactcc tgttagtgga
     4021 gagaaaactt tgcctaaaaa attcactctt aagctacatg cacacaggag tagcagtaca
     4081 gttccagata cgtatactgt tggtcttaac gtaaccagta atgttattta aagtgaatgt
     4141 atgagggatt cgatgttaaa aataaaatac ttattaatag gtctttcact gtcagctatg
     4201 agttcatact cactagctgc agcggggccc actctaacca aagaactggc attaaatgtg
     4261 ctttctcctg cagctctgga tgcaacttgg gctcctcagg ataatttaac attatccaat
     4321 actggcgttt ctaatacttt ggtgggtgtt ttgactcttt caaataccag tattgataca
     4381 gttagcattg cgagtacaaa tgtttctgat acatctaaga atggtacagt aacttttgca
     4441 catgagacaa ataactctgc tagctttgcc accaccattt caacagataa tgccaacatt
     4501 acgttggata aaaatgctgg aaatacgatt gttaaaacta caaatgggag tcagttgcca
     4561 actaatttac cacttaagtt tattaccact gaaggtaacg aacatttagt ttcaggtaat
     4621 taccgtgcaa atataacaat tacttcgaca attaaataat tatataatag acgtagcctt
     4681 cgaaataaag gctacgttgc tatctttatg tttgtgattt ataggcatca ttaaatagtc
     4741 aagctt
//