GenBank-Updates@genbank.bio.net (09/23/90)
LOCUS ECOFDHF 2971 bp ds-DNA BCT 23-SEP-1990
DEFINITION E.coli fdhF gene encoding the selenopolypeptide of the
benzylviologen-linked formate dehydrogenase, complete cds.
ACCESSION M13563 M18632
KEYWORDS anaerobically induced protein; dehydrogenase; fdhF gene;
formate dehydrogenase; readthrough.
SOURCE E.coli (MC4100) DNA, clone pFM20.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 699 to 2971)
AUTHORS Zinoni,F., Birkmann,A., Stadtman,T.C. and Bock,A.
TITLE Nucleotide sequence and expression of the selenocysteine-
containing polypeptide of formate dehydrogenase
(formate-hydrogen-lyase-linked) from Escherichia coli
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 4650-4654 (1986)
STANDARD full staff_review
REFERENCE 2 (bases 1 to 784)
AUTHORS Birkmann,A., Zinoni,F., Sawers,G. and Boeck,A.
TITLE Factors affecting transcriptional regulation of the formate-
hydrogen-lyase pathway of Escherichia coli
JOURNAL Arch. Microbiol. 148, 44-51 (1987)
STANDARD simple staff_review
COMMENT A ribosome binding site is located at positions 738-741 and a
rho-independent transcription termination structure at positions
2911-2932.
A nonsense codon ("tga") is located at positions 1166-1168 in the
middle of the coding region. However, E.coli (MC4100) actively
forms gas and exhibits wild-type-like benzylviologen-linked formate
dehydrogenase activity under anaerobic conditions. Experimental
evidence proved that the opal stop codon is translated in this
gene. The selenocysteine is probably inserted into the protein
co-translationally by suppression of the opal nonsense codon.
FEATURES from to/span description
pept 749 1165 formate dehydrogenase
/transl_except=(pos:1166-1168,aa:OTHER)
/note="selenocysteine" /nomgen="fdhF"
1169 2896 formate dehydrogenase
mRNA 708 > 2896 formate dehydrogenase mRNA
rpt 1126 1145 inverted repeat
rpt 1159 1173 inverted repeat
rpt 2910 2931 inverted repeat
site 1 100 putative VECTOR sequence pBR322
BASE COUNT 723 a 771 c 842 g 635 t
ORIGIN SmaI site.
1 gggcgctgcc ggcacctgtc ctacgagttg catgataaag aagacagtca taagtgcggc
61 gacgatagtc atgccccgcg cccaccggaa ggagctaccg gcagcggtgc ggactgttgt
121 aactcagaat aagaaatgag gccgctcatg gcgttggtct gaaattgccg ctgtttgacg
181 gtggacggtt gaatgccaat ctcgaaggca cgcgcgccgc cagcaacatg atgattgaac
241 gttacaacca gtcagtactg aacgcggtgc gtgacgttgc cgtcaacggc acgcgtctgc
301 aaacgctcaa cgacgagcga gaaatgcagg ctgaacgcgt ggaagccacg cgctttaccc
361 agcgcgctgc cgaggccgcc tatcagcgcg gcttaaccag ccgcttacag gccaccgaag
421 cccggttgcc agtgcttgcc gaagagatgt cattactgat gctggacagc cgccgggtga
481 tccaaagcat tcagttgatg aaatcgctgg gcggcgggta tcaggcaggt cccgtcgtcg
541 agaaaaaata aaatgtctgc cgcgtgatgg ctgtcacgcg gtatttcgtt tcgtcacgtc
601 aaaactgacg acagcctgtt tttcgtcaga gttttgaata aatagtgccc gtaatatcag
661 ggaatgaccc cacataaaat gtggcataaa agatgcatac tgtagtcgag agcgcgtatg
721 cgtgatttga ttaactggag cgagaccgat gaaaaaagtc gtcacggttt gcccctattg
781 cgcatcaggt tgcaaaatca acgtggtcgt cgataacggc aaaatcgtcc gggcggaggc
841 agcgcagggg aaaaccaacc agggtaccct gtgtctgaag ggttattatg gctgggactt
901 cattaacgat acccagatcc tgaccccgcg cctgaaaacc cccatgatcc gtcgccagcg
961 tggcggcaaa ctcgaacctg tttcctggga tgaggcactg aattacgttg ccgagcgcct
1021 gagcgccatc aaagagaagt acggtccgga tgccatccag acgaccggct cctcgcgtgg
1081 tacgggtaac gaaaccaact atgtaatgca aaaatttgcg cgcgccgtta ttggtaccaa
1141 taacgttgac tgctgcgctc gtgtctgaca cggcccatcg gttgcaggtc tgcaccaatc
1201 ggtcggtaat ggcgcaatga gcaatgctat taacgaaatt gataataccg atttagtgtt
1261 cgttttcggg tacaacccgg cggattccca cccaatcgtg gcgaatcacg taattaacgc
1321 taaacgtaac ggggcgaaaa ttatcgtctg cgatccgcgc aaaattgaaa ccgcgcgcat
1381 tgctgacatg cacattgcac tgaaaaacgg ctcgaacatc gcgctgttga atgcgatggg
1441 ccatgtcatt attgaagaaa atctgtacga caaagcgttc gtcgcttcac gtacagaagg
1501 ctttgaagag tatcgtaaaa tcgttgaagg ctacacgccg gagtcggttg aagatatcac
1561 cggcgtcagc gccagtgaga ttcgtcaggc ggcacggatg tatgcccagg cgaaaagcgc
1621 cgccatcctg tggggcatgg gtgtaaccca gttctaccag ggcgtggaaa ccgtgcgttc
1681 tctgaccagc ctcgcgatgc tgaccggtaa cctcggtaag ccgcatgcgg gtgttaaccc
1741 ggttcgtggt cagaacaacg ttcagggtgc ctgcgatatg ggcgcgctgc cggatacgta
1801 tccgggatac cagtacgtga aagatccggc taaccgcgag aaattcgcca aagcctgggg
1861 cgtggaaagc ctgccagcgc ataccggcta tcgcatcagc gagctgccgc accgcgcagc
1921 gcatggcgaa gtgcgtgccg cgtacattat gggcgaagat ccgctacaaa ctgacgcgga
1981 gctgtcggca gtacgtaaag cctttgaaga tctggaactg gttatcgttc aggacatctt
2041 tatgaccaaa accgcgtcgg cggcggatgt tattttaccg tcaacgtcgt ggggcgagca
2101 tgaaggcgtg tttactgcgg ctgaccgtgg cttccagcgt ttcttcaagg cggttgaacc
2161 gaaatgggat ctgaaaacgg actggcaaat catcagtgaa atcgccaccc gtatgggtta
2221 tccgatgcac tacaacaaca cccaggagat ctgggatgag ttgcgtcatc tgtgcccgga
2281 tttctacggt gcgacttacg agaaaatggg cgaactgggc ttcattcagt ggccttgccg
2341 cgatacttca gatgccgatc aggggacttc ttatctgttt aaagagaagt ttgatacccc
2401 gaacggtctg gcgcagttct tcacctgcga ctgggtagcg ccaatcgaca aactcaccga
2461 cgagtacccg atggtactgt caacggtgcg tgaagttggt cactactctt gccgttcgat
2521 gaccggtaac tgtgcggcac tggcggcgct ggctgatgaa cctggctacg cacaaatcaa
2581 taccgaagac gccaaacgtc tgggtattga agatgaggca ttggtttggg tgcactcgcg
2641 taaaggcaaa attatcaccc gtgcgcaggt cagcgatcgt ccgaacaaag gggcgattta
2701 catgacctac cagtggtgga ttggtgcctg taacgagctg gttaccgaaa acttaagccc
2761 gattacgaaa acgccggagt acaaatactg cgccgttcgc gtcgagccga tcgccgatca
2821 gcgcgccgcc gagcagtacg tgattgacga gtacaacaag ttgaaaactc gcctgcgcga
2881 agcggcactg gcgtaatacc gtcctttcta cagcctcctt tcggaggctg tttttttatc
2941 cattcgaact ctttatactg gttacttccc g
//