GenBank-Updates@genbank.bio.net (05/25/91)
LOCUS PT4T4ER1 8008 bp ds-DNA PHG 25-MAY-1991 DEFINITION Bacteriophage T4 DNA for 58.3 to 65.5 kb early region ACCESSION X04567 V00860 X01124 KEYWORDS denV gene; e gene; endonuclease V; inverted repeat; overlapping genes; signal peptide; tk gene; unidentified reading frame. SOURCE Bacteriophage T4 DNA. ORGANISM Bacteriophage T4 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 750) AUTHORS Owen,J.E., Schultz,D.W., Taylor,A. and Smith,G.R. TITLE Nucleotide sequence of the lysozyme gene of bacteriophage T4. Analysis of mutations involving repeated sequences JOURNAL J. Mol. Biol. 165, 229-248 (1983) STANDARD full automatic REFERENCE 2 (bases 1685 to 2253) AUTHORS Valerie,K., Henderson,E.E. and DeRiel,J.K. TITLE Identification, physical map location and sequence of the denV gene from bacteriophage T4 JOURNAL Nucleic Acids Res. 12, 8085-8096 (1984) STANDARD full automatic REFERENCE 3 (bases 1 to 8008) AUTHORS Valerie,K., Stevens,J., Lynch,M., Henderson,E.E. and de,R.J.K. TITLE Nucleotide sequence and analysis of the 58.3 to 65.5-Kb early region of bacteriophage T4 JOURNAL Nucleic Acids Res. 14, 8637-8654 (1986) STANDARD full automatic COMMENT SWISS-PROT; P00720; LYCV$BPT4. SWISS-PROT; P03719; VIN2$BPT4. SWISS-PROT; P04418; END5$BPT4. SWISS-PROT; P13300; KITH$BPT4. SWISS-PROT; P13302; VIN3$BPT4. SWISS-PROT; P13303; Y583$BPT4. SWISS-PROT; P13304; Y586$BPT4. SWISS-PROT; P13305; Y589$BPT4. SWISS-PROT; P13306; Y597$BPT4. SWISS-PROT; P13307; Y599$BPT4. SWISS-PROT; P13308; Y601$BPT4. SWISS-PROT; P13309; Y605$BPT4. SWISS-PROT; P13310; Y609$BPT4. SWISS-PROT; P13311; Y614$BPT4. SWISS-PROT; P13312; REGB$BPT4. SWISS-PROT; P13313; Y622$BPT4. SWISS-PROT; P13314; Y625$BPT4. SWISS-PROT; P13315; Y627$BPT4. SWISS-PROT; P13316; Y631$BPT4. SWISS-PROT; P13317; Y634$BPT4. SWISS-PROT; P13318; Y640$BPT4. From EMBL entry MYT4ER1; dated 19-SEP-1987. FEATURES Location/Qualifiers RBS 125..128 /note="pot. rRNA binding site (gene e)" CDS 135..626 /note="gene e product (AA 1-184)" /codon_start=135 conflict 544..544 /note="t is g in [2]" /citation=[1] promoter 549..554 /note="pot. -10 region (IpIII)" promoter 615..620 /note="pot. -10 region (IpIII)" promoter 625..630 /note="pot. -10 region (IpIII)" terminator 650..673 /note="put. stem-loop structure; pot. transcription terminator (gene e)" repeat_unit 650..657 /note="inverted repeat A" promoter 661..666 /note="pot. -10 region (IpIII)" repeat_unit 666..673 /note="inverted repeat A'" promoter 689..694 /note="pot. -10 region (IpIII)" RBS 708..711 /note="pot. rRNA binding site (IpIII)" CDS 717..1295 /note="precursor polypeptide (AA -10 to 183)" /codon_start=717 CDS 717..746 /note="signal peptide (AA -10 to -1)" /codon_start=717 CDS 747..1295 /note="mature internal protein IpIII (AA 1-183)" /codon_start=747 promoter 1296..1301 /note="pot. -10 region (IpII)" terminator 1342..1377 /note="put. stem-loop structure; pot. transcription terminator (IpIII)" repeat_unit 1342..1355 /note="inverted repeat B" promoter 1357..1362 /note="pot. -10 region (IpII)" repeat_unit 1364..1377 /note="inverted repeat B'" RBS 1405..1408 /note="pot. rRNA binding site (IpII)" CDS 1414..1713 /note="precursor polypeptide (AA -10 to 90)" /codon_start=1414 CDS 1414..1443 /note="signal peptide (AA -10 to -1)" /codon_start=1414 CDS 1444..1713 /note="mature internal protein IpII (AA 1-90)" /codon_start=1444 terminator 1726..1768 /note="put. stem-loop structure; pot. transcription terminator (IpII)" repeat_unit 1726..1743 /note="imp. inverted repeat C" promoter 1745..1750 /note="pot. -10 region (denV gene)" repeat_unit 1753..1768 /note="imp. inverted repeat C'" RBS 1763..1768 /note="pot. rRNA binding site (denV gene)" CDS 1777..2190 /note="endonuclease V (AA 1-138)" /codon_start=1777 RBS 2212..2215 /note="pot. rRNA binding site (ORF 64.0)" CDS 2220..2876 /note="ORF 64.0 (AA 1-219)" /codon_start=2220 RBS 2259..2262 /note="pot. 1st alternative rRNA binding site (ORF 64.0(1))" CDS 2268..2876 /note="alternative ORF 64.0(1) (AA 1-203)" /codon_start=2268 RBS 2285..2288 /note="alternative pot. rRNA binding site (ORF 64.0(2))" CDS 2295..2876 /note="alternative ORF 64.0(3) (AA 1-190)" /codon_start=2295 RBS 2331..2335 /note="alternative pot. rRNA binding site (ORF 64.0(4))" CDS 2340..2876 /note="alternaitve ORF 64.0(4) (AA 1-179)" /codon_start=2340 CDS 2376..2876 /note="alternative ORF 64.0(5) (AA 1-167)" /codon_start=2376 promoter 2717..2722 /note="pot. -35 region (ORF 63.4)" promoter 2736..2741 /note="pot. -10 region (ORF 63.4)" CDS 2825..3202 /note="pot. alternative ORF 63.4 (AA 1-126)" /codon_start=2825 RBS 2867..2870 /note="pot. rRNA binding site (ORF63.4)" CDS 2876..3202 /note="ORF 63.4 (AA 1-109)" /codon_start=2876 CDS 3213..3572 /note="ORF 63.1 (AA 1-120)" /codon_start=3213 RBS 3565..3568 /note="pot. rRNA binding site (ORF 62.7)" CDS 3575..3748 /note="ORF 62.7 (AA 1-58)" /codon_start=3575 RBS 3776..3781 /note="pot. rRNA binding site (ORF 62.5)" CDS 3788..4051 /note="ORF 62.5 (AA 1-88)" /codon_start=3788 RBS 4041..4044 /note="pot. rRNA binding site (ORF 62.2)" CDS 4054..4329 /note="ORF 62.2 (AA 1-92)" /codon_start=4054 terminator 4334..4392 /note="put. stem-loop structure; pot. transcription terminator (ORF 62.2)" repeat_unit 4334..4355 /note="imp. inverted repeat D" promoter 4359..4364 /note="pot. -10 region (ORF 61.9)" repeat_unit 4373..4392 /note="imp. inverted repeat D'" RBS 4381..4386 /note="pot. rRNA binding site (ORF 61.9)" CDS 4392..4850 /note="ORF 61.9 (AA 1-153)" /codon_start=4392 promoter 4821..4827 /note="pot. -35 region (ORF 61.4)" promoter 4837..4842 /note="pot. -10 region (ORF 61.4)" RBS 4852..4855 /note="pot. rRNA binding site (ORF 61.4)" CDS 4861..5403 /note="ORF 61.4 (AA 1-181)" /codon_start=4861 RBS 5387..5390 /note="pot. rRNA binding site (ORF 60.9)" CDS 5399..5743 /note="ORF 60.9 (AA 1-115)" /codon_start=5399 promoter 5703..5708 /note="pot. -35 region (ORF 60.5)" promoter 5720..5725 /note="pot. -10 region (ORF 60.5)" RBS 5728..5731 /note="pot. rRNA binding site (ORF 60.5)" CDS 5743..6207 /note="ORF 60.5 (AA 1-155)" /codon_start=5743 RBS 6194..6197 /note="pot. rRNA binding site (ORF 60.1)" CDS 6207..6416 /note="ORF 60.1 (AA 1-70)" /codon_start=6207 RBS 6403..6406 /note="pot. rRNA binding site (ORF 59.9)" CDS 6416..6598 /note="ORF 59.9 (AA 1-61)" /codon_start=6416 promoter 6586..6591 /note="pot. -10 region (ORF 59.7)" CDS 6598..6783 /note="ORF 59.7 (AA 1-62)" /codon_start=6598 promoter 6626..6631 /note="pot. -10 region (tk gene)" promoter 6760..6765 /note="pot. -10 region (tk gene)" RBS 6779..6782 /note="pot. rRNA binding site (tk gene)" CDS 6788..7366 /note="tk gene product (AA 1-193)" /codon_start=6788 RBS 7402..7406 /note="pot. rRNA binding site (ORF 58.9)" CDS 7412..7621 /note="ORF 58.9 (AA 1-70)" /codon_start=7412 RBS 7626..7630 /note="pot. rRNA binding site (ORF 58.6)" CDS 7637..7927 /note="ORF 58.6 (AA 1-97)" /codon_start=7637 RBS 7918..7922 /note="pot. rRNA binding site (ORF 58.3)" CDS 7927..>8008 /note="ORF 58.3 (27AA) (8008 is 1st base in codon)" /codon_start=7927 BASE COUNT 2666 a 1297 c 1588 g 2457 t ORIGIN 1 gattccagag atggacgctt ttgctcttat tcctcgtact cagtggcaat atgtgatggg 61 tccttcactt taccgaataa tgaacaacct cttttaattt tataaatacc ttctataaat 121 acttaggagg tattatgaat atatttgaaa tgttacgtat agatgaacgt cttagactta 181 aaatctataa agacacagaa ggctattaca ctattggcat cggtcatttg cttacaaaaa 241 gtccatcact taatgctgct aaatctgaat tagataaagc tattgggcgt aattgcaatg 301 gtgtaattac aaaagatgag gctgaaaaac tctttaatca ggatgttgat gctgctgttc 361 gcggaattct gagaaatgct aaattaaaac cggtttatga ttctcttgat gcggttcgtc 421 gctgtgcatt gattaatatg gttttccaaa tgggagaaac cggtgtggca ggatttacta 481 actctttacg tatgcttcaa caaaaacgct gggatgaagc agcagttaac ttagctaaaa 541 gtatatggta taatcaaaca cctaatcgcg caaaacgagt cattacaacg tttagaactg 601 gcacttggga cgcgtataaa aatctataaa gctgtttact ttctcttgga attgtgatag 661 tatattcaca attacttgaa tagacaatta ctaattaaaa tatttaaagg aaacatatga 721 aaacatatca agaatttatt gccgaagctt ctgtagtaaa ggccaaaggc attaacaaag 781 atgagtggac ctaccgatca ggaaacggct ttgaccctaa aacagctcct attgaacggt 841 acttagctac aaaggcttcc gactttaaag ccttcgcttg ggaaggactt cgctggcgta 901 ccgatttaaa tattgaagtt gacggactta aatttgctca tattgaagat gttgttgcta 961 gtaacttaga ctcagaattt gttaaagctg atgcagacct tcgccgctgg aatttaaaac 1021 tgttctctaa acagaaaggc ccgaagtttg tgcctaaagc cggtaaatgg gtcattgata 1081 ataaattggc taaagctgtc aacttcgcag gtcttgaatt tgccaagcat aaatcatcat 1141 ggaaaggtct tgatgcaatg gctttccgta aagaatttgc cgatgttatg actaaaggcg 1201 gctttaaggc agaaatagat acctctaaag gtaagtttaa agacgctaat attcagtacg 1261 cttacgccgt tgctaatgca gcccgtggta attcttaata aagcttatac ttgggacgct 1321 taaataaaag cagtttacaa ctcctagaat tgtgaatata ttatcacaat tctaggatag 1381 aataataaaa atatttacat ttaaaggaaa catatgaaaa catatcaaga atttattgcc 1441 gaagcgcgag tgggcgcagg taaattagaa gccgctgtaa ataaaaaggc ccattcattt 1501 catgatttgc ccgataaaga ccgtaagaaa cttgtaagcc tttatattga cagagagcgt 1561 attctcgctc ttcctggcgc taatgaaggt aaacaggcca agcctttgaa tgccgtcgaa 1621 aagaaaattg ataactttgc ttctaagttc ggcatgtcta tggatgacct tcagcaagcg 1681 gctatcgaag cagctaaagc aattaaagat aaataacagt ttacatctcc tgtaggtatg 1741 atactataga cctatcaact acaggagaac actaaaatga ctcgtatcaa ccttacttta 1801 gtatctgaat tggctgacca acacttaatg gctgaatatc gtgaattgcc gcgtgttttt 1861 ggtgcagttc gtaagcatgt tgctaacggt aaacgtgttc gtgattttaa aatcagtcct 1921 acttttatcc ttggcgcagg tcatgttaca ttcttttacg ataagctcga gttcttacgt 1981 aaacgtcaaa ttgagcttat agctgaatgt ttaaaacgtg gttttaatat caaggatact 2041 acagtccagg atattagtga tattcctcag gaattccgtg gtgattatat tccccatgaa 2101 gcttctattg ctatatcaca agctcgttta gatgaaaaaa ttgcacaacg tcctacttgg 2161 tacaaatact acggtaaggc gatttatgca taagggaaca acctggacct catgattata 2221 tgagggattc ccgccaacct gtaataaggt cgagcccaag cgcggtaatg ggtaaataca 2281 gaaatggaca attcatgtgc cacggaatgg cccaaactta tagagcttat agagaagaaa 2341 tgagaacatt tttaactggt ccttatctat ccctgatgaa tgcttttaca caccattctg 2401 atgctagagt agaagaaatt tgtaaaaacg aatatatccc gccatttgaa gacttactta 2461 aacagtattg tacacttcga ctagatggtg gacgtcaatc cggtaaatca attgctgtga 2521 ctaactttgc tgctaattgg ttgtatgatg gcggaacagt tattgttctt tctaatactt 2581 cagcttatgc taaaatttct gcaaataaca tcaaaaagga attttcgcgt tattctaatg 2641 atgatatacg ttttcgttta tttactgatt ctgtacgcag ttttattggt aataaaggaa 2701 gcaagttcag aggtttaaag ctttcgcgaa ttttgtatat aattgatgag cctgtcaaat 2761 ctcctgatat ggataagatt tatagtgtcc atattgacac cgtacactac tgctgtaata 2821 gtaaatgttg cattggtggt attactcgtc cacagttttt cgtaatcgga atgcaatgat 2881 gacagacact cagcttttcg aatatcttta tttttcgcca aaaactatta aaaataaatt 2941 ggtgaatcat tttgaaattt tggcaaaaaa taacattttg agcgaatttt atcccaagca 3001 atacaaatta caaaaaggcg tattcaaagg atgcagagtt ttgtgcactg ctcctaatgc 3061 acggctaatg aataaaattc catattttac catggaattt attgatggac cttttaaagg 3121 attaattacg caaagtttaa tggcatatga ttctgagcca tttttaatta aagaacaatc 3181 ttggataaat ttattttcta attgaggttt atatgaaagc atatcaaatt cttgaaggca 3241 cacataaagg tactatttat tttgaagatg gtattcaagc acgaattatt gtctctaaaa 3301 cctttaaaga ggactctttt gtagacccag aaattttcta tggtttgcat gcccgtgaaa 3361 ttgaaattga gccacaacct acagttaaaa ttgaaggtgg tcaacacctg aacgttaacg 3421 ttctgcgtca tgaaactctg gaagatgcag ttaagcatcc ggaaaaatat ccgcagctga 3481 ccatccgtgt atccggttat gcagttcgct ttaactctct gactccggaa cagcagcgcg 3541 acgttatcgc tcgtaccttt actgaaagtt tgtaatggca aagataatta ttgaaggttc 3601 tgaagatgtg ctaaatgctt tcgcgagtgg tttagtaact caggcgaaca gcaatttaat 3661 gaagcgtgga atatgggtga tattgatgga atttatccta cgacagaaat ttctgttcaa 3721 ggctatggca ttcatgaacc tattcgttta gttgaatacg tattatgtac tggtgaggaa 3781 gtcaaatatg attgaagata ttaagggtta taaaccacat actgaagaga aaatcggtaa 3841 agtaaatgct attaaagacg ctgaagttcg tttaggactt atctttgatg ctttatatga 3901 tgaattctgg gaagcactag ataattgcga agactgtgaa ttcgcgaaga attatgctga 3961 aagtctcgat cagttaacta ttgctaaaac gaaactcaaa gaagccagta tgtgggcttg 4021 tcgtgcagtg ttccaaccag aggaaaaata ctaatggctc aattaagcgc agggtttggt 4081 tatgagtatt atactgcccc tcgtcgtgta tctgttgctc ctaagaaaat tcaaagtctt 4141 gatgacttcc aggaagtagt ccgtaacgct ttccaggact atgcacgtta tcttaaagaa 4201 gattcgcagg actgtctcga agaagatgaa attgcttact atacgcagcg tcttgaacag 4261 ctcaaaaatc tacatgaggt tcgtgccgaa gtttcaaagt ctatgaataa attgattaga 4321 tttaaagaat aactgtttac ttttcctctt gactgtggta taatttttct atcagttaag 4381 aggagaataa catgactatc aatacagaag tttttatccg tcgaaataag cttcgtcgtc 4441 actttgagtc ggagtttcgt caaattaaca atgagattcg tgaggcatca aaagcagcag 4501 gagtctcatc gtttcatcta aaatattctc aacatcttct tgatcgcgca attcaacggg 4561 agattgatga gacatacgtt tttgaattat tccataaaat aaaagaccat gttttagaag 4621 ttaatgaatt cctgagtatg cctccgcgtc ctgacattga cgaggatttt attgatgggg 4681 ttgaatatcg tcctggacgt ttagaaatca cagatggaaa tctttggctt ggatttacag 4741 tttgtaaacc taacgagaag ttcaaagacc cgtcacttca atgtaggatg gcaattatca 4801 acagtcgtcg tttaccagga aaggcttcta aagcagtaat taaaactcaa tgaggtaagc 4861 atgagaaaag cactactcgc tggtctattg gccatttcaa tgatggcaca tagctccgag 4921 catactttca gtaatgtcca actcgataac atgcgttacg cgtatcaatt cggggaacaa 4981 ttttctaagg atggaaaata taaaacacac aaaaatatcc acaagagcgg attaggtcat 5041 ataatggctg ccattttatg gcaagaaagc tctggcggag ttaatttaaa atctaaacca 5101 aagcatcacg cctacggaat gttccaaaat tatttgccta ctatgcgagc aagagttaag 5161 gaacttggtt ataatatgac cgatgctgaa ataaaaagaa tgttgaataa acgatccaat 5221 tcagcttcct gggcgtacat tgaactttct tattggttaa atatacataa gggcgatata 5281 agaaaagcaa tatcctctta taattcggga tggaatgtta aagcaggttc taaatatgct 5341 tctgaagtcc tagaaaaggc taattacctt aaaaataata aacttttgga aatagtaaat 5401 gactaaaatt ttggttttat gtataggatt aatttcattt tctgcttctg cgtcagcaga 5461 tacatcatat actgaaatta gagaatatgt aaaccgcact gcggcagatt attgtgggaa 5521 aaataaggca tgccaagctg aatttgcaca gaaattaata tatgcatata aagacggaga 5581 aagagataaa tcaagcagat acaaaaacga tacattgtta aaacgatatg ctaaaaagtg 5641 gaatacctta gaatgttcag ttgcggagga gaaagataaa gccgcttgtc attcaatggt 5701 tgaccgtttg gtagattctt ataatcgagg attgagtact agatgattgt aaaatatatc 5761 aagggcgata ttgtcgccct tttcgctgaa ggtaaaaata ttgcacatgg atgtaattgt 5821 tttcatacta tgggttcagg cgtagcgggt caattaacca aagctttccc taaaattttg 5881 gaagctgata aattacagac tgaatggggt gatgtaacta aactcggttc ttactcagtc 5941 tatgaaaaat actttaggac tcataaagct tactgcttca atctttatac tcaatttcaa 6001 ccagggccaa attttgagta ttccgcttta atgaattgta tgttagaatt aaatgagttt 6061 ggtgaaaata aactgattaa acctacaatc tatatgccta ggattggtgc aggcataggt 6121 aaagggaact gggatattat tgaggggatt ttagatacat attcctctaa attagaaatt 6181 gtgattgttg attgggaacc attattatga atatacatta tccacatcca tatgacccaa 6241 agaataaggc agtaattatt cgtcaatggg aacgcatttg tcgcactaaa tgtccaatta 6301 atagtccaca tgatgtagat aaagactaca ttggaacatt cgttgaatat acctttattg 6361 ataagaaagg tcgtaaacag catgtagaag aatactgctt aaaggtgaca tggttatgag 6421 tttaagcaaa gaacaaaaag acacactctt ttctcttatc cacgaagtta tggataaaaa 6481 tagtgaattg gaaaaagttt gtaatgaatg cggtcctttt agcgcaaacg agtacgaaga 6541 actttctaaa gaattcgata ataaagaaca agaactcatt gattatataa attccttatg 6601 attactcgcg aacaaaagaa cgaaatatta tttttagttg gtgaaattat tagtttagaa 6661 aaggatttgt cttttgaaat atcttctgaa tatggagatg ccgaaacata ttacgaatta 6721 gtaaaatcta tcgataaagc tgaaaatgat ttagaaacat atttagaaaa tttaactaag 6781 gactaagatg gcgagtttaa tttttactta tgcagcaatg aatgctggaa aatctgcttc 6841 tcttttgatt gctgcacata attataaaga acgtggaatg agtgtattag ttcttaagcc 6901 tgctattgat actcgcgatt ctgtctgtga agtcgtttct cgcattggaa ttaagcagga 6961 agcgaatatt attacagatg atatggatat tttcgagttc tataaatggg ctgaagcaca 7021 aaaagatatt cattgcgtat ttgtagatga agctcagttt ttaaaaactg aacaggtgca 7081 tcaattgagc cgaattgttg atacatataa tgttcctgtt atggcttatg ggctaaggac 7141 tgatttcgct ggaaaattat ttgaaggttc taaagaactt ttagcgattg cagataaact 7201 tattgaacta aaagcagttt gtcattgtgg taaaaaagcg attatgacag ctcgattaat 7261 ggaagatgga acaccagtta aagaaggtaa tcaaatttgt attggtgatg aaatttatgt 7321 ttctttgtgt agaaaacatt ggaatgaatt aactaaaaag ctcggttagt gcaaaagtta 7381 taaataggtt tatctaacta aaggggtata tatgctacaa ttaactgaaa agcaacttcg 7441 caatcttact gttcttcaat tagatgaaat tcgtagggaa gttggaaata tcatttcagc 7501 tttgcgtcga gaagtatcac tcaaccaatc tccggcagac tatactagat tgcgaaattt 7561 tgaaaaatac cttgataaag ttaaggccgt gcatcggcac aaagtaaata caggacaaaa 7621 atgataggag gcctttatgg ccttaaaagc aacagcactt tttgccatgc taggattgtc 7681 atttgtttta tctccatcga ttgaagcgaa tgtcgatcct cattttgata aatttatgga 7741 atctggtatt aggcacgttt atatgctttt tgaaaataaa agcgtagaat cgtctgaaca 7801 attctatagt tttatgagaa cgacctataa aaatgacccg tgctcttctg attttgaatg 7861 tatagagcga ggcgcggaga tggcacaatc atacgctaga attatgaaca ttaaattgga 7921 gactgaatga aattcagcga cttttcacaa agtggaaaac cttcaaaggc agatgaatac 7981 ttaggtttat taatggctgc acaagctt //