GenBank-Updates@genbank.bio.net (09/23/90)
LOCUS ECOFDHF 2971 bp ds-DNA BCT 23-SEP-1990 DEFINITION E.coli fdhF gene encoding the selenopolypeptide of the benzylviologen-linked formate dehydrogenase, complete cds. ACCESSION M13563 M18632 KEYWORDS anaerobically induced protein; dehydrogenase; fdhF gene; formate dehydrogenase; readthrough. SOURCE E.coli (MC4100) DNA, clone pFM20. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 699 to 2971) AUTHORS Zinoni,F., Birkmann,A., Stadtman,T.C. and Bock,A. TITLE Nucleotide sequence and expression of the selenocysteine- containing polypeptide of formate dehydrogenase (formate-hydrogen-lyase-linked) from Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 4650-4654 (1986) STANDARD full staff_review REFERENCE 2 (bases 1 to 784) AUTHORS Birkmann,A., Zinoni,F., Sawers,G. and Boeck,A. TITLE Factors affecting transcriptional regulation of the formate- hydrogen-lyase pathway of Escherichia coli JOURNAL Arch. Microbiol. 148, 44-51 (1987) STANDARD simple staff_review COMMENT A ribosome binding site is located at positions 738-741 and a rho-independent transcription termination structure at positions 2911-2932. A nonsense codon ("tga") is located at positions 1166-1168 in the middle of the coding region. However, E.coli (MC4100) actively forms gas and exhibits wild-type-like benzylviologen-linked formate dehydrogenase activity under anaerobic conditions. Experimental evidence proved that the opal stop codon is translated in this gene. The selenocysteine is probably inserted into the protein co-translationally by suppression of the opal nonsense codon. FEATURES from to/span description pept 749 1165 formate dehydrogenase /transl_except=(pos:1166-1168,aa:OTHER) /note="selenocysteine" /nomgen="fdhF" 1169 2896 formate dehydrogenase mRNA 708 > 2896 formate dehydrogenase mRNA rpt 1126 1145 inverted repeat rpt 1159 1173 inverted repeat rpt 2910 2931 inverted repeat site 1 100 putative VECTOR sequence pBR322 BASE COUNT 723 a 771 c 842 g 635 t ORIGIN SmaI site. 1 gggcgctgcc ggcacctgtc ctacgagttg catgataaag aagacagtca taagtgcggc 61 gacgatagtc atgccccgcg cccaccggaa ggagctaccg gcagcggtgc ggactgttgt 121 aactcagaat aagaaatgag gccgctcatg gcgttggtct gaaattgccg ctgtttgacg 181 gtggacggtt gaatgccaat ctcgaaggca cgcgcgccgc cagcaacatg atgattgaac 241 gttacaacca gtcagtactg aacgcggtgc gtgacgttgc cgtcaacggc acgcgtctgc 301 aaacgctcaa cgacgagcga gaaatgcagg ctgaacgcgt ggaagccacg cgctttaccc 361 agcgcgctgc cgaggccgcc tatcagcgcg gcttaaccag ccgcttacag gccaccgaag 421 cccggttgcc agtgcttgcc gaagagatgt cattactgat gctggacagc cgccgggtga 481 tccaaagcat tcagttgatg aaatcgctgg gcggcgggta tcaggcaggt cccgtcgtcg 541 agaaaaaata aaatgtctgc cgcgtgatgg ctgtcacgcg gtatttcgtt tcgtcacgtc 601 aaaactgacg acagcctgtt tttcgtcaga gttttgaata aatagtgccc gtaatatcag 661 ggaatgaccc cacataaaat gtggcataaa agatgcatac tgtagtcgag agcgcgtatg 721 cgtgatttga ttaactggag cgagaccgat gaaaaaagtc gtcacggttt gcccctattg 781 cgcatcaggt tgcaaaatca acgtggtcgt cgataacggc aaaatcgtcc gggcggaggc 841 agcgcagggg aaaaccaacc agggtaccct gtgtctgaag ggttattatg gctgggactt 901 cattaacgat acccagatcc tgaccccgcg cctgaaaacc cccatgatcc gtcgccagcg 961 tggcggcaaa ctcgaacctg tttcctggga tgaggcactg aattacgttg ccgagcgcct 1021 gagcgccatc aaagagaagt acggtccgga tgccatccag acgaccggct cctcgcgtgg 1081 tacgggtaac gaaaccaact atgtaatgca aaaatttgcg cgcgccgtta ttggtaccaa 1141 taacgttgac tgctgcgctc gtgtctgaca cggcccatcg gttgcaggtc tgcaccaatc 1201 ggtcggtaat ggcgcaatga gcaatgctat taacgaaatt gataataccg atttagtgtt 1261 cgttttcggg tacaacccgg cggattccca cccaatcgtg gcgaatcacg taattaacgc 1321 taaacgtaac ggggcgaaaa ttatcgtctg cgatccgcgc aaaattgaaa ccgcgcgcat 1381 tgctgacatg cacattgcac tgaaaaacgg ctcgaacatc gcgctgttga atgcgatggg 1441 ccatgtcatt attgaagaaa atctgtacga caaagcgttc gtcgcttcac gtacagaagg 1501 ctttgaagag tatcgtaaaa tcgttgaagg ctacacgccg gagtcggttg aagatatcac 1561 cggcgtcagc gccagtgaga ttcgtcaggc ggcacggatg tatgcccagg cgaaaagcgc 1621 cgccatcctg tggggcatgg gtgtaaccca gttctaccag ggcgtggaaa ccgtgcgttc 1681 tctgaccagc ctcgcgatgc tgaccggtaa cctcggtaag ccgcatgcgg gtgttaaccc 1741 ggttcgtggt cagaacaacg ttcagggtgc ctgcgatatg ggcgcgctgc cggatacgta 1801 tccgggatac cagtacgtga aagatccggc taaccgcgag aaattcgcca aagcctgggg 1861 cgtggaaagc ctgccagcgc ataccggcta tcgcatcagc gagctgccgc accgcgcagc 1921 gcatggcgaa gtgcgtgccg cgtacattat gggcgaagat ccgctacaaa ctgacgcgga 1981 gctgtcggca gtacgtaaag cctttgaaga tctggaactg gttatcgttc aggacatctt 2041 tatgaccaaa accgcgtcgg cggcggatgt tattttaccg tcaacgtcgt ggggcgagca 2101 tgaaggcgtg tttactgcgg ctgaccgtgg cttccagcgt ttcttcaagg cggttgaacc 2161 gaaatgggat ctgaaaacgg actggcaaat catcagtgaa atcgccaccc gtatgggtta 2221 tccgatgcac tacaacaaca cccaggagat ctgggatgag ttgcgtcatc tgtgcccgga 2281 tttctacggt gcgacttacg agaaaatggg cgaactgggc ttcattcagt ggccttgccg 2341 cgatacttca gatgccgatc aggggacttc ttatctgttt aaagagaagt ttgatacccc 2401 gaacggtctg gcgcagttct tcacctgcga ctgggtagcg ccaatcgaca aactcaccga 2461 cgagtacccg atggtactgt caacggtgcg tgaagttggt cactactctt gccgttcgat 2521 gaccggtaac tgtgcggcac tggcggcgct ggctgatgaa cctggctacg cacaaatcaa 2581 taccgaagac gccaaacgtc tgggtattga agatgaggca ttggtttggg tgcactcgcg 2641 taaaggcaaa attatcaccc gtgcgcaggt cagcgatcgt ccgaacaaag gggcgattta 2701 catgacctac cagtggtgga ttggtgcctg taacgagctg gttaccgaaa acttaagccc 2761 gattacgaaa acgccggagt acaaatactg cgccgttcgc gtcgagccga tcgccgatca 2821 gcgcgccgcc gagcagtacg tgattgacga gtacaacaag ttgaaaactc gcctgcgcga 2881 agcggcactg gcgtaatacc gtcctttcta cagcctcctt tcggaggctg tttttttatc 2941 cattcgaact ctttatactg gttacttccc g //