GenBank-Updates@genbank.bio.net (05/25/91)
LOCUS PT4T4ER1 8008 bp ds-DNA PHG 25-MAY-1991
DEFINITION Bacteriophage T4 DNA for 58.3 to 65.5 kb early region
ACCESSION X04567 V00860 X01124
KEYWORDS denV gene; e gene; endonuclease V; inverted repeat;
overlapping genes; signal peptide; tk gene;
unidentified reading frame.
SOURCE Bacteriophage T4 DNA.
ORGANISM Bacteriophage T4
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (bases 1 to 750)
AUTHORS Owen,J.E., Schultz,D.W., Taylor,A. and Smith,G.R.
TITLE Nucleotide sequence of the lysozyme gene of bacteriophage T4.
Analysis of mutations involving repeated sequences
JOURNAL J. Mol. Biol. 165, 229-248 (1983)
STANDARD full automatic
REFERENCE 2 (bases 1685 to 2253)
AUTHORS Valerie,K., Henderson,E.E. and DeRiel,J.K.
TITLE Identification, physical map location and sequence of the denV gene
from bacteriophage T4
JOURNAL Nucleic Acids Res. 12, 8085-8096 (1984)
STANDARD full automatic
REFERENCE 3 (bases 1 to 8008)
AUTHORS Valerie,K., Stevens,J., Lynch,M., Henderson,E.E. and de,R.J.K.
TITLE Nucleotide sequence and analysis of the 58.3 to 65.5-Kb early
region of bacteriophage T4
JOURNAL Nucleic Acids Res. 14, 8637-8654 (1986)
STANDARD full automatic
COMMENT SWISS-PROT; P00720; LYCV$BPT4. SWISS-PROT; P03719; VIN2$BPT4.
SWISS-PROT; P04418; END5$BPT4. SWISS-PROT; P13300; KITH$BPT4.
SWISS-PROT; P13302; VIN3$BPT4. SWISS-PROT; P13303; Y583$BPT4.
SWISS-PROT; P13304; Y586$BPT4. SWISS-PROT; P13305; Y589$BPT4.
SWISS-PROT; P13306; Y597$BPT4. SWISS-PROT; P13307; Y599$BPT4.
SWISS-PROT; P13308; Y601$BPT4. SWISS-PROT; P13309; Y605$BPT4.
SWISS-PROT; P13310; Y609$BPT4. SWISS-PROT; P13311; Y614$BPT4.
SWISS-PROT; P13312; REGB$BPT4. SWISS-PROT; P13313; Y622$BPT4.
SWISS-PROT; P13314; Y625$BPT4. SWISS-PROT; P13315; Y627$BPT4.
SWISS-PROT; P13316; Y631$BPT4. SWISS-PROT; P13317; Y634$BPT4.
SWISS-PROT; P13318; Y640$BPT4.
From EMBL entry MYT4ER1; dated 19-SEP-1987.
FEATURES Location/Qualifiers
RBS 125..128
/note="pot. rRNA binding site (gene e)"
CDS 135..626
/note="gene e product (AA 1-184)"
/codon_start=135
conflict 544..544
/note="t is g in [2]"
/citation=[1]
promoter 549..554
/note="pot. -10 region (IpIII)"
promoter 615..620
/note="pot. -10 region (IpIII)"
promoter 625..630
/note="pot. -10 region (IpIII)"
terminator 650..673
/note="put. stem-loop structure; pot. transcription
terminator (gene e)"
repeat_unit 650..657
/note="inverted repeat A"
promoter 661..666
/note="pot. -10 region (IpIII)"
repeat_unit 666..673
/note="inverted repeat A'"
promoter 689..694
/note="pot. -10 region (IpIII)"
RBS 708..711
/note="pot. rRNA binding site (IpIII)"
CDS 717..1295
/note="precursor polypeptide (AA -10 to 183)"
/codon_start=717
CDS 717..746
/note="signal peptide (AA -10 to -1)"
/codon_start=717
CDS 747..1295
/note="mature internal protein IpIII (AA 1-183)"
/codon_start=747
promoter 1296..1301
/note="pot. -10 region (IpII)"
terminator 1342..1377
/note="put. stem-loop structure; pot. transcription
terminator (IpIII)"
repeat_unit 1342..1355
/note="inverted repeat B"
promoter 1357..1362
/note="pot. -10 region (IpII)"
repeat_unit 1364..1377
/note="inverted repeat B'"
RBS 1405..1408
/note="pot. rRNA binding site (IpII)"
CDS 1414..1713
/note="precursor polypeptide (AA -10 to 90)"
/codon_start=1414
CDS 1414..1443
/note="signal peptide (AA -10 to -1)"
/codon_start=1414
CDS 1444..1713
/note="mature internal protein IpII (AA 1-90)"
/codon_start=1444
terminator 1726..1768
/note="put. stem-loop structure; pot. transcription
terminator (IpII)"
repeat_unit 1726..1743
/note="imp. inverted repeat C"
promoter 1745..1750
/note="pot. -10 region (denV gene)"
repeat_unit 1753..1768
/note="imp. inverted repeat C'"
RBS 1763..1768
/note="pot. rRNA binding site (denV gene)"
CDS 1777..2190
/note="endonuclease V (AA 1-138)"
/codon_start=1777
RBS 2212..2215
/note="pot. rRNA binding site (ORF 64.0)"
CDS 2220..2876
/note="ORF 64.0 (AA 1-219)"
/codon_start=2220
RBS 2259..2262
/note="pot. 1st alternative rRNA binding site (ORF
64.0(1))"
CDS 2268..2876
/note="alternative ORF 64.0(1) (AA 1-203)"
/codon_start=2268
RBS 2285..2288
/note="alternative pot. rRNA binding site (ORF 64.0(2))"
CDS 2295..2876
/note="alternative ORF 64.0(3) (AA 1-190)"
/codon_start=2295
RBS 2331..2335
/note="alternative pot. rRNA binding site (ORF 64.0(4))"
CDS 2340..2876
/note="alternaitve ORF 64.0(4) (AA 1-179)"
/codon_start=2340
CDS 2376..2876
/note="alternative ORF 64.0(5) (AA 1-167)"
/codon_start=2376
promoter 2717..2722
/note="pot. -35 region (ORF 63.4)"
promoter 2736..2741
/note="pot. -10 region (ORF 63.4)"
CDS 2825..3202
/note="pot. alternative ORF 63.4 (AA 1-126)"
/codon_start=2825
RBS 2867..2870
/note="pot. rRNA binding site (ORF63.4)"
CDS 2876..3202
/note="ORF 63.4 (AA 1-109)"
/codon_start=2876
CDS 3213..3572
/note="ORF 63.1 (AA 1-120)"
/codon_start=3213
RBS 3565..3568
/note="pot. rRNA binding site (ORF 62.7)"
CDS 3575..3748
/note="ORF 62.7 (AA 1-58)"
/codon_start=3575
RBS 3776..3781
/note="pot. rRNA binding site (ORF 62.5)"
CDS 3788..4051
/note="ORF 62.5 (AA 1-88)"
/codon_start=3788
RBS 4041..4044
/note="pot. rRNA binding site (ORF 62.2)"
CDS 4054..4329
/note="ORF 62.2 (AA 1-92)"
/codon_start=4054
terminator 4334..4392
/note="put. stem-loop structure; pot. transcription
terminator (ORF 62.2)"
repeat_unit 4334..4355
/note="imp. inverted repeat D"
promoter 4359..4364
/note="pot. -10 region (ORF 61.9)"
repeat_unit 4373..4392
/note="imp. inverted repeat D'"
RBS 4381..4386
/note="pot. rRNA binding site (ORF 61.9)"
CDS 4392..4850
/note="ORF 61.9 (AA 1-153)"
/codon_start=4392
promoter 4821..4827
/note="pot. -35 region (ORF 61.4)"
promoter 4837..4842
/note="pot. -10 region (ORF 61.4)"
RBS 4852..4855
/note="pot. rRNA binding site (ORF 61.4)"
CDS 4861..5403
/note="ORF 61.4 (AA 1-181)"
/codon_start=4861
RBS 5387..5390
/note="pot. rRNA binding site (ORF 60.9)"
CDS 5399..5743
/note="ORF 60.9 (AA 1-115)"
/codon_start=5399
promoter 5703..5708
/note="pot. -35 region (ORF 60.5)"
promoter 5720..5725
/note="pot. -10 region (ORF 60.5)"
RBS 5728..5731
/note="pot. rRNA binding site (ORF 60.5)"
CDS 5743..6207
/note="ORF 60.5 (AA 1-155)"
/codon_start=5743
RBS 6194..6197
/note="pot. rRNA binding site (ORF 60.1)"
CDS 6207..6416
/note="ORF 60.1 (AA 1-70)"
/codon_start=6207
RBS 6403..6406
/note="pot. rRNA binding site (ORF 59.9)"
CDS 6416..6598
/note="ORF 59.9 (AA 1-61)"
/codon_start=6416
promoter 6586..6591
/note="pot. -10 region (ORF 59.7)"
CDS 6598..6783
/note="ORF 59.7 (AA 1-62)"
/codon_start=6598
promoter 6626..6631
/note="pot. -10 region (tk gene)"
promoter 6760..6765
/note="pot. -10 region (tk gene)"
RBS 6779..6782
/note="pot. rRNA binding site (tk gene)"
CDS 6788..7366
/note="tk gene product (AA 1-193)"
/codon_start=6788
RBS 7402..7406
/note="pot. rRNA binding site (ORF 58.9)"
CDS 7412..7621
/note="ORF 58.9 (AA 1-70)"
/codon_start=7412
RBS 7626..7630
/note="pot. rRNA binding site (ORF 58.6)"
CDS 7637..7927
/note="ORF 58.6 (AA 1-97)"
/codon_start=7637
RBS 7918..7922
/note="pot. rRNA binding site (ORF 58.3)"
CDS 7927..>8008
/note="ORF 58.3 (27AA) (8008 is 1st base in codon)"
/codon_start=7927
BASE COUNT 2666 a 1297 c 1588 g 2457 t
ORIGIN
1 gattccagag atggacgctt ttgctcttat tcctcgtact cagtggcaat atgtgatggg
61 tccttcactt taccgaataa tgaacaacct cttttaattt tataaatacc ttctataaat
121 acttaggagg tattatgaat atatttgaaa tgttacgtat agatgaacgt cttagactta
181 aaatctataa agacacagaa ggctattaca ctattggcat cggtcatttg cttacaaaaa
241 gtccatcact taatgctgct aaatctgaat tagataaagc tattgggcgt aattgcaatg
301 gtgtaattac aaaagatgag gctgaaaaac tctttaatca ggatgttgat gctgctgttc
361 gcggaattct gagaaatgct aaattaaaac cggtttatga ttctcttgat gcggttcgtc
421 gctgtgcatt gattaatatg gttttccaaa tgggagaaac cggtgtggca ggatttacta
481 actctttacg tatgcttcaa caaaaacgct gggatgaagc agcagttaac ttagctaaaa
541 gtatatggta taatcaaaca cctaatcgcg caaaacgagt cattacaacg tttagaactg
601 gcacttggga cgcgtataaa aatctataaa gctgtttact ttctcttgga attgtgatag
661 tatattcaca attacttgaa tagacaatta ctaattaaaa tatttaaagg aaacatatga
721 aaacatatca agaatttatt gccgaagctt ctgtagtaaa ggccaaaggc attaacaaag
781 atgagtggac ctaccgatca ggaaacggct ttgaccctaa aacagctcct attgaacggt
841 acttagctac aaaggcttcc gactttaaag ccttcgcttg ggaaggactt cgctggcgta
901 ccgatttaaa tattgaagtt gacggactta aatttgctca tattgaagat gttgttgcta
961 gtaacttaga ctcagaattt gttaaagctg atgcagacct tcgccgctgg aatttaaaac
1021 tgttctctaa acagaaaggc ccgaagtttg tgcctaaagc cggtaaatgg gtcattgata
1081 ataaattggc taaagctgtc aacttcgcag gtcttgaatt tgccaagcat aaatcatcat
1141 ggaaaggtct tgatgcaatg gctttccgta aagaatttgc cgatgttatg actaaaggcg
1201 gctttaaggc agaaatagat acctctaaag gtaagtttaa agacgctaat attcagtacg
1261 cttacgccgt tgctaatgca gcccgtggta attcttaata aagcttatac ttgggacgct
1321 taaataaaag cagtttacaa ctcctagaat tgtgaatata ttatcacaat tctaggatag
1381 aataataaaa atatttacat ttaaaggaaa catatgaaaa catatcaaga atttattgcc
1441 gaagcgcgag tgggcgcagg taaattagaa gccgctgtaa ataaaaaggc ccattcattt
1501 catgatttgc ccgataaaga ccgtaagaaa cttgtaagcc tttatattga cagagagcgt
1561 attctcgctc ttcctggcgc taatgaaggt aaacaggcca agcctttgaa tgccgtcgaa
1621 aagaaaattg ataactttgc ttctaagttc ggcatgtcta tggatgacct tcagcaagcg
1681 gctatcgaag cagctaaagc aattaaagat aaataacagt ttacatctcc tgtaggtatg
1741 atactataga cctatcaact acaggagaac actaaaatga ctcgtatcaa ccttacttta
1801 gtatctgaat tggctgacca acacttaatg gctgaatatc gtgaattgcc gcgtgttttt
1861 ggtgcagttc gtaagcatgt tgctaacggt aaacgtgttc gtgattttaa aatcagtcct
1921 acttttatcc ttggcgcagg tcatgttaca ttcttttacg ataagctcga gttcttacgt
1981 aaacgtcaaa ttgagcttat agctgaatgt ttaaaacgtg gttttaatat caaggatact
2041 acagtccagg atattagtga tattcctcag gaattccgtg gtgattatat tccccatgaa
2101 gcttctattg ctatatcaca agctcgttta gatgaaaaaa ttgcacaacg tcctacttgg
2161 tacaaatact acggtaaggc gatttatgca taagggaaca acctggacct catgattata
2221 tgagggattc ccgccaacct gtaataaggt cgagcccaag cgcggtaatg ggtaaataca
2281 gaaatggaca attcatgtgc cacggaatgg cccaaactta tagagcttat agagaagaaa
2341 tgagaacatt tttaactggt ccttatctat ccctgatgaa tgcttttaca caccattctg
2401 atgctagagt agaagaaatt tgtaaaaacg aatatatccc gccatttgaa gacttactta
2461 aacagtattg tacacttcga ctagatggtg gacgtcaatc cggtaaatca attgctgtga
2521 ctaactttgc tgctaattgg ttgtatgatg gcggaacagt tattgttctt tctaatactt
2581 cagcttatgc taaaatttct gcaaataaca tcaaaaagga attttcgcgt tattctaatg
2641 atgatatacg ttttcgttta tttactgatt ctgtacgcag ttttattggt aataaaggaa
2701 gcaagttcag aggtttaaag ctttcgcgaa ttttgtatat aattgatgag cctgtcaaat
2761 ctcctgatat ggataagatt tatagtgtcc atattgacac cgtacactac tgctgtaata
2821 gtaaatgttg cattggtggt attactcgtc cacagttttt cgtaatcgga atgcaatgat
2881 gacagacact cagcttttcg aatatcttta tttttcgcca aaaactatta aaaataaatt
2941 ggtgaatcat tttgaaattt tggcaaaaaa taacattttg agcgaatttt atcccaagca
3001 atacaaatta caaaaaggcg tattcaaagg atgcagagtt ttgtgcactg ctcctaatgc
3061 acggctaatg aataaaattc catattttac catggaattt attgatggac cttttaaagg
3121 attaattacg caaagtttaa tggcatatga ttctgagcca tttttaatta aagaacaatc
3181 ttggataaat ttattttcta attgaggttt atatgaaagc atatcaaatt cttgaaggca
3241 cacataaagg tactatttat tttgaagatg gtattcaagc acgaattatt gtctctaaaa
3301 cctttaaaga ggactctttt gtagacccag aaattttcta tggtttgcat gcccgtgaaa
3361 ttgaaattga gccacaacct acagttaaaa ttgaaggtgg tcaacacctg aacgttaacg
3421 ttctgcgtca tgaaactctg gaagatgcag ttaagcatcc ggaaaaatat ccgcagctga
3481 ccatccgtgt atccggttat gcagttcgct ttaactctct gactccggaa cagcagcgcg
3541 acgttatcgc tcgtaccttt actgaaagtt tgtaatggca aagataatta ttgaaggttc
3601 tgaagatgtg ctaaatgctt tcgcgagtgg tttagtaact caggcgaaca gcaatttaat
3661 gaagcgtgga atatgggtga tattgatgga atttatccta cgacagaaat ttctgttcaa
3721 ggctatggca ttcatgaacc tattcgttta gttgaatacg tattatgtac tggtgaggaa
3781 gtcaaatatg attgaagata ttaagggtta taaaccacat actgaagaga aaatcggtaa
3841 agtaaatgct attaaagacg ctgaagttcg tttaggactt atctttgatg ctttatatga
3901 tgaattctgg gaagcactag ataattgcga agactgtgaa ttcgcgaaga attatgctga
3961 aagtctcgat cagttaacta ttgctaaaac gaaactcaaa gaagccagta tgtgggcttg
4021 tcgtgcagtg ttccaaccag aggaaaaata ctaatggctc aattaagcgc agggtttggt
4081 tatgagtatt atactgcccc tcgtcgtgta tctgttgctc ctaagaaaat tcaaagtctt
4141 gatgacttcc aggaagtagt ccgtaacgct ttccaggact atgcacgtta tcttaaagaa
4201 gattcgcagg actgtctcga agaagatgaa attgcttact atacgcagcg tcttgaacag
4261 ctcaaaaatc tacatgaggt tcgtgccgaa gtttcaaagt ctatgaataa attgattaga
4321 tttaaagaat aactgtttac ttttcctctt gactgtggta taatttttct atcagttaag
4381 aggagaataa catgactatc aatacagaag tttttatccg tcgaaataag cttcgtcgtc
4441 actttgagtc ggagtttcgt caaattaaca atgagattcg tgaggcatca aaagcagcag
4501 gagtctcatc gtttcatcta aaatattctc aacatcttct tgatcgcgca attcaacggg
4561 agattgatga gacatacgtt tttgaattat tccataaaat aaaagaccat gttttagaag
4621 ttaatgaatt cctgagtatg cctccgcgtc ctgacattga cgaggatttt attgatgggg
4681 ttgaatatcg tcctggacgt ttagaaatca cagatggaaa tctttggctt ggatttacag
4741 tttgtaaacc taacgagaag ttcaaagacc cgtcacttca atgtaggatg gcaattatca
4801 acagtcgtcg tttaccagga aaggcttcta aagcagtaat taaaactcaa tgaggtaagc
4861 atgagaaaag cactactcgc tggtctattg gccatttcaa tgatggcaca tagctccgag
4921 catactttca gtaatgtcca actcgataac atgcgttacg cgtatcaatt cggggaacaa
4981 ttttctaagg atggaaaata taaaacacac aaaaatatcc acaagagcgg attaggtcat
5041 ataatggctg ccattttatg gcaagaaagc tctggcggag ttaatttaaa atctaaacca
5101 aagcatcacg cctacggaat gttccaaaat tatttgccta ctatgcgagc aagagttaag
5161 gaacttggtt ataatatgac cgatgctgaa ataaaaagaa tgttgaataa acgatccaat
5221 tcagcttcct gggcgtacat tgaactttct tattggttaa atatacataa gggcgatata
5281 agaaaagcaa tatcctctta taattcggga tggaatgtta aagcaggttc taaatatgct
5341 tctgaagtcc tagaaaaggc taattacctt aaaaataata aacttttgga aatagtaaat
5401 gactaaaatt ttggttttat gtataggatt aatttcattt tctgcttctg cgtcagcaga
5461 tacatcatat actgaaatta gagaatatgt aaaccgcact gcggcagatt attgtgggaa
5521 aaataaggca tgccaagctg aatttgcaca gaaattaata tatgcatata aagacggaga
5581 aagagataaa tcaagcagat acaaaaacga tacattgtta aaacgatatg ctaaaaagtg
5641 gaatacctta gaatgttcag ttgcggagga gaaagataaa gccgcttgtc attcaatggt
5701 tgaccgtttg gtagattctt ataatcgagg attgagtact agatgattgt aaaatatatc
5761 aagggcgata ttgtcgccct tttcgctgaa ggtaaaaata ttgcacatgg atgtaattgt
5821 tttcatacta tgggttcagg cgtagcgggt caattaacca aagctttccc taaaattttg
5881 gaagctgata aattacagac tgaatggggt gatgtaacta aactcggttc ttactcagtc
5941 tatgaaaaat actttaggac tcataaagct tactgcttca atctttatac tcaatttcaa
6001 ccagggccaa attttgagta ttccgcttta atgaattgta tgttagaatt aaatgagttt
6061 ggtgaaaata aactgattaa acctacaatc tatatgccta ggattggtgc aggcataggt
6121 aaagggaact gggatattat tgaggggatt ttagatacat attcctctaa attagaaatt
6181 gtgattgttg attgggaacc attattatga atatacatta tccacatcca tatgacccaa
6241 agaataaggc agtaattatt cgtcaatggg aacgcatttg tcgcactaaa tgtccaatta
6301 atagtccaca tgatgtagat aaagactaca ttggaacatt cgttgaatat acctttattg
6361 ataagaaagg tcgtaaacag catgtagaag aatactgctt aaaggtgaca tggttatgag
6421 tttaagcaaa gaacaaaaag acacactctt ttctcttatc cacgaagtta tggataaaaa
6481 tagtgaattg gaaaaagttt gtaatgaatg cggtcctttt agcgcaaacg agtacgaaga
6541 actttctaaa gaattcgata ataaagaaca agaactcatt gattatataa attccttatg
6601 attactcgcg aacaaaagaa cgaaatatta tttttagttg gtgaaattat tagtttagaa
6661 aaggatttgt cttttgaaat atcttctgaa tatggagatg ccgaaacata ttacgaatta
6721 gtaaaatcta tcgataaagc tgaaaatgat ttagaaacat atttagaaaa tttaactaag
6781 gactaagatg gcgagtttaa tttttactta tgcagcaatg aatgctggaa aatctgcttc
6841 tcttttgatt gctgcacata attataaaga acgtggaatg agtgtattag ttcttaagcc
6901 tgctattgat actcgcgatt ctgtctgtga agtcgtttct cgcattggaa ttaagcagga
6961 agcgaatatt attacagatg atatggatat tttcgagttc tataaatggg ctgaagcaca
7021 aaaagatatt cattgcgtat ttgtagatga agctcagttt ttaaaaactg aacaggtgca
7081 tcaattgagc cgaattgttg atacatataa tgttcctgtt atggcttatg ggctaaggac
7141 tgatttcgct ggaaaattat ttgaaggttc taaagaactt ttagcgattg cagataaact
7201 tattgaacta aaagcagttt gtcattgtgg taaaaaagcg attatgacag ctcgattaat
7261 ggaagatgga acaccagtta aagaaggtaa tcaaatttgt attggtgatg aaatttatgt
7321 ttctttgtgt agaaaacatt ggaatgaatt aactaaaaag ctcggttagt gcaaaagtta
7381 taaataggtt tatctaacta aaggggtata tatgctacaa ttaactgaaa agcaacttcg
7441 caatcttact gttcttcaat tagatgaaat tcgtagggaa gttggaaata tcatttcagc
7501 tttgcgtcga gaagtatcac tcaaccaatc tccggcagac tatactagat tgcgaaattt
7561 tgaaaaatac cttgataaag ttaaggccgt gcatcggcac aaagtaaata caggacaaaa
7621 atgataggag gcctttatgg ccttaaaagc aacagcactt tttgccatgc taggattgtc
7681 atttgtttta tctccatcga ttgaagcgaa tgtcgatcct cattttgata aatttatgga
7741 atctggtatt aggcacgttt atatgctttt tgaaaataaa agcgtagaat cgtctgaaca
7801 attctatagt tttatgagaa cgacctataa aaatgacccg tgctcttctg attttgaatg
7861 tatagagcga ggcgcggaga tggcacaatc atacgctaga attatgaaca ttaaattgga
7921 gactgaatga aattcagcga cttttcacaa agtggaaaac cttcaaaggc agatgaatac
7981 ttaggtttat taatggctgc acaagctt
//