[bionet.molbio.genbank.updates] ACCESSION M13563 M18632

GenBank-Updates@genbank.bio.net (09/23/90)

LOCUS       ECOFDHF      2971 bp ds-DNA             BCT       23-SEP-1990
DEFINITION  E.coli fdhF gene encoding the selenopolypeptide of the
            benzylviologen-linked formate dehydrogenase, complete cds.
ACCESSION   M13563 M18632
KEYWORDS    anaerobically induced protein; dehydrogenase; fdhF gene;
            formate dehydrogenase; readthrough.
SOURCE      E.coli (MC4100) DNA, clone pFM20.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 699 to 2971)
  AUTHORS   Zinoni,F., Birkmann,A., Stadtman,T.C. and Bock,A.
  TITLE     Nucleotide sequence and expression of the selenocysteine-
            containing polypeptide of formate dehydrogenase
            (formate-hydrogen-lyase-linked) from Escherichia coli
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 83, 4650-4654 (1986)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 784)
  AUTHORS   Birkmann,A., Zinoni,F., Sawers,G. and Boeck,A.
  TITLE     Factors affecting transcriptional regulation of the formate-
            hydrogen-lyase pathway of Escherichia coli
  JOURNAL   Arch. Microbiol. 148, 44-51 (1987)
  STANDARD  simple staff_review
COMMENT     A ribosome binding site is located at positions 738-741 and a
            rho-independent transcription termination structure at positions
            2911-2932.
            
            A nonsense codon ("tga") is located at positions 1166-1168 in the
            middle of the coding region.  However, E.coli (MC4100) actively
            forms gas and exhibits wild-type-like benzylviologen-linked formate
            dehydrogenase activity under anaerobic conditions.  Experimental
            evidence proved that the opal stop codon is translated in this
            gene.  The selenocysteine is probably inserted into the protein
            co-translationally by suppression of the opal nonsense codon.
FEATURES       from  to/span     description
    pept        749     1165     formate dehydrogenase
                                 /transl_except=(pos:1166-1168,aa:OTHER)
                                 /note="selenocysteine" /nomgen="fdhF"
               1169     2896     formate dehydrogenase
    mRNA        708  >  2896     formate dehydrogenase mRNA
    rpt        1126     1145     inverted repeat
    rpt        1159     1173     inverted repeat
    rpt        2910     2931     inverted repeat
    site          1      100     putative VECTOR sequence pBR322
BASE COUNT      723 a    771 c    842 g    635 t
ORIGIN      SmaI site.
        1 gggcgctgcc ggcacctgtc ctacgagttg catgataaag aagacagtca taagtgcggc
       61 gacgatagtc atgccccgcg cccaccggaa ggagctaccg gcagcggtgc ggactgttgt
      121 aactcagaat aagaaatgag gccgctcatg gcgttggtct gaaattgccg ctgtttgacg
      181 gtggacggtt gaatgccaat ctcgaaggca cgcgcgccgc cagcaacatg atgattgaac
      241 gttacaacca gtcagtactg aacgcggtgc gtgacgttgc cgtcaacggc acgcgtctgc
      301 aaacgctcaa cgacgagcga gaaatgcagg ctgaacgcgt ggaagccacg cgctttaccc
      361 agcgcgctgc cgaggccgcc tatcagcgcg gcttaaccag ccgcttacag gccaccgaag
      421 cccggttgcc agtgcttgcc gaagagatgt cattactgat gctggacagc cgccgggtga
      481 tccaaagcat tcagttgatg aaatcgctgg gcggcgggta tcaggcaggt cccgtcgtcg
      541 agaaaaaata aaatgtctgc cgcgtgatgg ctgtcacgcg gtatttcgtt tcgtcacgtc
      601 aaaactgacg acagcctgtt tttcgtcaga gttttgaata aatagtgccc gtaatatcag
      661 ggaatgaccc cacataaaat gtggcataaa agatgcatac tgtagtcgag agcgcgtatg
      721 cgtgatttga ttaactggag cgagaccgat gaaaaaagtc gtcacggttt gcccctattg
      781 cgcatcaggt tgcaaaatca acgtggtcgt cgataacggc aaaatcgtcc gggcggaggc
      841 agcgcagggg aaaaccaacc agggtaccct gtgtctgaag ggttattatg gctgggactt
      901 cattaacgat acccagatcc tgaccccgcg cctgaaaacc cccatgatcc gtcgccagcg
      961 tggcggcaaa ctcgaacctg tttcctggga tgaggcactg aattacgttg ccgagcgcct
     1021 gagcgccatc aaagagaagt acggtccgga tgccatccag acgaccggct cctcgcgtgg
     1081 tacgggtaac gaaaccaact atgtaatgca aaaatttgcg cgcgccgtta ttggtaccaa
     1141 taacgttgac tgctgcgctc gtgtctgaca cggcccatcg gttgcaggtc tgcaccaatc
     1201 ggtcggtaat ggcgcaatga gcaatgctat taacgaaatt gataataccg atttagtgtt
     1261 cgttttcggg tacaacccgg cggattccca cccaatcgtg gcgaatcacg taattaacgc
     1321 taaacgtaac ggggcgaaaa ttatcgtctg cgatccgcgc aaaattgaaa ccgcgcgcat
     1381 tgctgacatg cacattgcac tgaaaaacgg ctcgaacatc gcgctgttga atgcgatggg
     1441 ccatgtcatt attgaagaaa atctgtacga caaagcgttc gtcgcttcac gtacagaagg
     1501 ctttgaagag tatcgtaaaa tcgttgaagg ctacacgccg gagtcggttg aagatatcac
     1561 cggcgtcagc gccagtgaga ttcgtcaggc ggcacggatg tatgcccagg cgaaaagcgc
     1621 cgccatcctg tggggcatgg gtgtaaccca gttctaccag ggcgtggaaa ccgtgcgttc
     1681 tctgaccagc ctcgcgatgc tgaccggtaa cctcggtaag ccgcatgcgg gtgttaaccc
     1741 ggttcgtggt cagaacaacg ttcagggtgc ctgcgatatg ggcgcgctgc cggatacgta
     1801 tccgggatac cagtacgtga aagatccggc taaccgcgag aaattcgcca aagcctgggg
     1861 cgtggaaagc ctgccagcgc ataccggcta tcgcatcagc gagctgccgc accgcgcagc
     1921 gcatggcgaa gtgcgtgccg cgtacattat gggcgaagat ccgctacaaa ctgacgcgga
     1981 gctgtcggca gtacgtaaag cctttgaaga tctggaactg gttatcgttc aggacatctt
     2041 tatgaccaaa accgcgtcgg cggcggatgt tattttaccg tcaacgtcgt ggggcgagca
     2101 tgaaggcgtg tttactgcgg ctgaccgtgg cttccagcgt ttcttcaagg cggttgaacc
     2161 gaaatgggat ctgaaaacgg actggcaaat catcagtgaa atcgccaccc gtatgggtta
     2221 tccgatgcac tacaacaaca cccaggagat ctgggatgag ttgcgtcatc tgtgcccgga
     2281 tttctacggt gcgacttacg agaaaatggg cgaactgggc ttcattcagt ggccttgccg
     2341 cgatacttca gatgccgatc aggggacttc ttatctgttt aaagagaagt ttgatacccc
     2401 gaacggtctg gcgcagttct tcacctgcga ctgggtagcg ccaatcgaca aactcaccga
     2461 cgagtacccg atggtactgt caacggtgcg tgaagttggt cactactctt gccgttcgat
     2521 gaccggtaac tgtgcggcac tggcggcgct ggctgatgaa cctggctacg cacaaatcaa
     2581 taccgaagac gccaaacgtc tgggtattga agatgaggca ttggtttggg tgcactcgcg
     2641 taaaggcaaa attatcaccc gtgcgcaggt cagcgatcgt ccgaacaaag gggcgattta
     2701 catgacctac cagtggtgga ttggtgcctg taacgagctg gttaccgaaa acttaagccc
     2761 gattacgaaa acgccggagt acaaatactg cgccgttcgc gtcgagccga tcgccgatca
     2821 gcgcgccgcc gagcagtacg tgattgacga gtacaacaag ttgaaaactc gcctgcgcga
     2881 agcggcactg gcgtaatacc gtcctttcta cagcctcctt tcggaggctg tttttttatc
     2941 cattcgaact ctttatactg gttacttccc g
//