GenBank-Updates@genbank.bio.net (05/18/91)
LOCUS ECOPHOF 1980 bp ds-DNA BCT 18-MAY-1991
DEFINITION E. coli gene phoE encoding the phosphate limitation inducible outer
membrane pore protein.
ACCESSION V00316 J01662 X06652
KEYWORDS membrane protein; phoE gene; pore protein; proB gene.
SOURCE Escherichia coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 1980)
AUTHORS Overbeeke,N., Bergmans,H., van Mansfeld,F. and Lugtenberg,B.
TITLE Complete nucleotide sequence of phoE, the structural gene for the
phosphate limitation inducible outer membrane pore protein of
Escherichia coli K12
JOURNAL J. Mol. Biol. 163, 513-532 (1983)
STANDARD full automatic
REFERENCE 2 (bases 181 to 480)
AUTHORS Alexandrov,N.N. and Mironov,A.A.
TITLE Recognition of Escherichia coli promoters according to primary DNA
structure
JOURNAL Mol. Biol. (Mosk) 21, 204-210 (1987)
STANDARD full automatic
REFERENCE 3 (bases 221 to 489)
AUTHORS Tommassen,J., Koster,M. and Overduin,P.
TITLE Molecular analysis of the promoter region of the Escherichia coli
K-12 phoE gene; Identification of an element, upstream from the
promoter required for efficient expression of phoE protein
JOURNAL J. Mol. Biol. 198, 633-641 (1987)
STANDARD full automatic
REFERENCE 4 (bases 681 to 681)
AUTHORS Tommassen,J.
JOURNAL Unpublished (1988)
STANDARD full automatic
COMMENT SWISS-PROT; P02932; PHOE$ECOLI.
*source: strain=K12; clone=pJP300; library=pACYC184; Sequence
partially presented in 3' to 5' direction in [1] source information
is taken from [2] [3] does not give excplisively the range that was
sequenced, but reports the found difference [4] reports promoter
structure
Data kindly reviewed (13-APR-1983) by N. Overbeeke. Data kindly
reviewed (10-OCT-1988) by Tommassen J.
From EMBL 26 entry ECPHOE; dated 22-FEB-1991.
FEATURES Location/Qualifiers
precursor_RNA complement(<1..223)
/note="proB transcript [4]"
CDS complement(<1..187)
/note="coding sequence unknown protein proB [4]"
/codon_start=187
conflict 202..202
/note="c is g in [4]"
/citation=[2]
promoter complement(232..237)
/note="pot. -10 region [4]"
promoter complement(255..260)
/note="pot. -35 region [4]"
misc_feature complement(305..310)
/note="-10 region [2]"
misc_feature complement(321..337)
/note="pseudo-pho box [2]"
promoter 376..392
/note="pho box [2]"
promoter 404..409
/note="-10 region [2]"
promoter 415..420
/note="pot. -35 region [4]"
precursor_RNA 417..>416
/note="transcript [2]"
promoter 438..443
/note="pot. -10 region [4]"
precursor_RNA 448..>447
/note="transcript [4]"
RBS 465..468
/note="pot. ribosome binding site [2]"
CDS 475..1527
/product="phoE protein"
/gene="phoE"
/codon_start=475
CDS complement(1572..1970)
/note="coding sequence unknown protein"
/codon_start=1970
BASE COUNT 549 a 457 c 459 g 515 t
ORIGIN
1 ggtaacctag gtgctcacgt ccggcggcga tcgcgcccga cgtcacaata acaatccgat
61 gcccggcggc atgtaactgc gcgcactggc gaacaagttc aacgatatgg gcacggttca
121 gacggcgcga tccgcctgtt agcacactgg tgccgagttt taccaccagc gtctggctgt
181 cactcatgat tctctgccat tcaattttag gaaaaatgat atcaaacgaa cgttttagca
241 ggactgtcgt cggttgccaa ccatctgcga gcaaagcatg gcgttttgtt gcgcgggatc
301 agcaagccta gcggcagttg tttacgcttt tattacagat ttaataaatt accacatttt
361 aagaatatta ttaatctgta atatatcttt aacaatctca ggttaaaaac tttcctgttt
421 tcaacgggac tctcccgctg aatattcgcg cgttaattaa aatcaggaat gaaaatgaaa
481 aagagcactc tggcattagt ggtgatgggc attgtggcat ctgcatctgt acaggctgca
541 gaaatatata ataaagacgg taataaactg gatgtctatg gcaaagttaa agccatgcat
601 tatatgagtg ataacgccag taaagatggc gaccagagtt atatccgttt tggtttcaaa
661 ggcgaaacac aaattaacga ccaactgact ggttatggtc gttgggaagc agagtttgcc
721 ggtaataaag cagagagtga tactgcacag caaaaaacgc gtctcgcttt tgccgggttg
781 aaatataaag atttgggttc tttcgattat ggtcgtaacc tgggggcgtt gtatgacgtg
841 gaagcctgga ccgatatgtt cccggaattt ggtggcgatt cctcggcgca gaccgacaac
901 tttatgacca aacgcgccag cggtctggcg acgtatcgga acaccgactt cttcggcgtt
961 atcgatggcc tgaacttaac cctgcaatat caagggaaaa acgaaaaccg cgacgttaaa
1021 aagcaaaacg gcgatggctt cggcacgtca ttgacatatg actttggcgg cagcgatttc
1081 gccattagtg gggcctatac caactcagat cgcaccaacg agcagaacct gcaaagccgt
1141 ggcacaggca agcgtgcaga agcatgggca acaggtctga aatacgatgc caataatatt
1201 tatctggcaa ctttctattc tgaaacacgc aaaatgacgc caataactgg cggctttgcc
1261 aataagacac agaactttga agcggtcgct caataccagt ttgactttgg tctgcgtcca
1321 tcgctgggtt atgtcttatc gaaagggaaa gatattgaag gtatcggtga tgaagatctg
1381 gtcaattata tcgacgtcgg tgctacgtat tatttcaaca aaaatatgtc agcgtttgtt
1441 gattataaaa tcaaccaact ggatagcgat aacaaattga atattaataa tgatgatatt
1501 gtcgcggttg gcatgacgta tcagttttaa tgaatattgc cggatgtgat gcatccggca
1561 gatttcactc acgccgttaa cttcaccggc tcgtcacgaa aatcatccgc cggttccagc
1621 ttcagattca gcgtcgtcag cagttcacgc agcttctcgt gaaactcacg cagggtgtgc
1681 tccagtcgtt caaccacttc agtgtctttt accggaacac tcttccagtc gcctgcttta
1741 tcgaacagac caaactggta actgtaggta aaacgggatt cctgcgcttc cagctccatc
1801 caccagcccc agaattcacg cacttccggt gccggtttca cgttgacgca tacagccaga
1861 caatcgaaaa agaatcgatt atctttgcac ttaccttcac gaatatacgg gcctagtgcg
1921 gtaaattttt tgttcaatct gctcttcaaa tgtccactcg gtaacgtcat tgctatctcc
//