GenBank-Updates@genbank.bio.net (05/25/91)
LOCUS PT4T4G23 1883 bp ds-DNA PHG 25-MAY-1991 DEFINITION Bacteriophage T4 gene 23 for major capsid protein ACCESSION X01774 J02507 KEYWORDS capsid protein; inverted repeat; overlapping genes. SOURCE Bacteriophage T4 DNA. ORGANISM Bacteriophage T4 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (sites) AUTHORS Christensen,A.C. and Young,E.T. TITLE T4 late transcripts are initiated near a conserved DNA sequence JOURNAL Nature 299, 369-371 (1982) STANDARD full staff_review REFERENCE 2 (sites) AUTHORS Elliott,T. and Geiduschek,E.P. JOURNAL Cell 36, 211-219 (1984) STANDARD full staff_review REFERENCE 3 (bases 1 to 1883) AUTHORS Parker,M.L., Christensen,A.C., Boosman,A., Stockard,J., Young,E.T. and Doermann,A.H. TITLE Nucleotide sequence of bacteriophage T4 gene 23 and the amino acid sequence of its product JOURNAL J. Mol. Biol. 180, 399-416 (1984) STANDARD full automatic COMMENT SWISS-PROT; P04535; COAT$BPT4. From EMBL entry MYT4G23; dated 22-APR-1990. FEATURES Location/Qualifiers CDS <1..159 /note="gp22 (gene 22)" /codon_start=1 promoter 39..42 /note="put. promoter (gene 23)" RBS 169..172 /note="pot. ribosome binding site" CDS 181..1743 /note="gp23: major capsid protein (gene 23) (aa 1-521)" /codon_start=181 mutation 313..313 /note="C is T in am H11" mutation 427..427 /note="C is T in am C140" mutation 448..448 /note="C is T in am B17" mutation 649..649 /note="C is T in am B272" mutation 727..727 /note="C is T in am H32 and am E509" mutation 886..886 /note="C is T in am E506" mutation 920..920 /note="G is A in am E161" mutation 1106..1106 /note="G is A in am B278 and am E1270" mutation 1123..1123 /note="C is T in am C137" mutation 1183..1183 /note="C is T in am H36 and am E389" mutation 1246..1246 /note="C is T in am E507" mutation 1372..1372 /note="C is T in am A489" mutation 1459..1459 /note="C is T in am C208" mutation 1648..1648 /note="C is T in am E757" mutation 1669..1669 /note="C is T in am E1236" misc_signal 1772..1791 /note="pot. transcription terminator (gene 21, 22, 23)" repeat_unit 1772..1779 /note="inverted repeat A" repeat_unit 1784..1791 /note="inverted repeat A'" BASE COUNT 527 a 394 c 429 g 533 t ORIGIN 1 aagaaatcta ataaagatga aagcactatt actgagagta taaatactcc tgatactgaa 61 gcagccggac tgaatttcgt cactgaagct gtagaagata aagctgcaca gggtgcagaa 121 gatattgtaa gtgtatatgc gaaagtcgca tctcgtttct aattttaaag gttaacacaa 181 atgactatca aaactaaagc tgaacttttg aacaaatgga agccattact ggaaggtgaa 241 ggtttaccgg aaattgctaa tagcaaacaa gcgattatcg ctaaaatctt tgaaaaccag 301 gaaaaagatt tccagacagc tccggaatat aaagacgaaa aaattgctca ggcattcggt 361 tctttcttaa cagaagctga aatcggtggt gaccacggtt acaatgctac caacatcgct 421 gcaggtcaga cttctggcgc agtaactcag attggcccag ctgttatggg tatggtacgt 481 cgtgctattc ctaacctgat tgctttcgat atttgtggtg ttcagccgat gaacagcccg 541 actggccagg tattcgcact gcgcgcagta tatggtaaag acccagtggc tgccggtgct 601 aaagaagcat tccacccaat gtatggtcca gatgcaatgt tctctggtca gggtgctgct 661 aagaaattcc cagctctggc tgctagcaca caaaccacag taggtgatat ctatactcac 721 ttcttccagg aaactggtac tgtatatctg caagcttctg ttcaagtaac aatcgatgct 781 ggtgcgactg atgctgctaa attagatgct gaaattaaga aacaaatgga agctggtgca 841 ctggtagaaa tcgctgaagg tatggctact tctatcgctg aactccagga aggtttcaat 901 ggttctaccg ataacccatg gaatgaaatg ggcttccgta tcgataagca agttatcgaa 961 gctaaatctc gtcagctgaa agctgcttac tctattgaat tagcacaaga cctccgcgct 1021 gttcacggta tggatgctga tgctgaactg tctggtattc tggctacaga aattatgctg 1081 gaaatcaacc gtgaagttgt tgattggatt aactactcag ctcaggttgg taaatctggt 1141 atgaccctga ctccgggttc taaagctggt gtatttgact tccaggaccc aattgatatt 1201 cgtggtgctc gctgggcggg tgaatccttt aaagctctgt tgttccagat tgacaaagaa 1261 gcagttgaaa ttgctcgtca gaccggtcgt ggtgaaggta acttcattat cgcttcccgt 1321 aacgtagtta acgttttggc ttcagttgat accggcattt cttatgctgc acagggtctg 1381 gctaccggct ttagcactga tactaccaag tcagtatttg ctggtgttct gggtggtaaa 1441 taccgcgtat atatcgacca gtatgctaaa caggattatt tcactgtagg ttataaaggt 1501 ccgaacgaaa tggatgctgg tatttactat gctccatatg tagctctgac tccgctgcgt 1561 ggttccgatc cgaagaactt ccaaccggta atgggattca aaactcgtta cggtatcggt 1621 atcaacccat ttgcagaatc cgctgctcag gctccggctt ctcgcatcca gagcggtatg 1681 ccttctattc tgaatagcct tggtaaaaac gcttacttta gacgtgtata tgttaaaggt 1741 atctaatctc taacgataga aacacaattt tagggaacct tcgggttccc tttttctatt 1801 ttatacgata gcaatcaggc atatcatccg catttatcca attgcgaata gttttaggac 1861 taactttaaa atcgtccgct gcg //