GenBank-Updates@genbank.bio.net (05/25/91)
LOCUS PT4T4G23 1883 bp ds-DNA PHG 25-MAY-1991
DEFINITION Bacteriophage T4 gene 23 for major capsid protein
ACCESSION X01774 J02507
KEYWORDS capsid protein; inverted repeat; overlapping genes.
SOURCE Bacteriophage T4 DNA.
ORGANISM Bacteriophage T4
Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE 1 (sites)
AUTHORS Christensen,A.C. and Young,E.T.
TITLE T4 late transcripts are initiated near a conserved DNA sequence
JOURNAL Nature 299, 369-371 (1982)
STANDARD full staff_review
REFERENCE 2 (sites)
AUTHORS Elliott,T. and Geiduschek,E.P.
JOURNAL Cell 36, 211-219 (1984)
STANDARD full staff_review
REFERENCE 3 (bases 1 to 1883)
AUTHORS Parker,M.L., Christensen,A.C., Boosman,A., Stockard,J., Young,E.T.
and Doermann,A.H.
TITLE Nucleotide sequence of bacteriophage T4 gene 23 and the amino acid
sequence of its product
JOURNAL J. Mol. Biol. 180, 399-416 (1984)
STANDARD full automatic
COMMENT SWISS-PROT; P04535; COAT$BPT4.
From EMBL entry MYT4G23; dated 22-APR-1990.
FEATURES Location/Qualifiers
CDS <1..159
/note="gp22 (gene 22)"
/codon_start=1
promoter 39..42
/note="put. promoter (gene 23)"
RBS 169..172
/note="pot. ribosome binding site"
CDS 181..1743
/note="gp23: major capsid protein (gene 23) (aa 1-521)"
/codon_start=181
mutation 313..313
/note="C is T in am H11"
mutation 427..427
/note="C is T in am C140"
mutation 448..448
/note="C is T in am B17"
mutation 649..649
/note="C is T in am B272"
mutation 727..727
/note="C is T in am H32 and am E509"
mutation 886..886
/note="C is T in am E506"
mutation 920..920
/note="G is A in am E161"
mutation 1106..1106
/note="G is A in am B278 and am E1270"
mutation 1123..1123
/note="C is T in am C137"
mutation 1183..1183
/note="C is T in am H36 and am E389"
mutation 1246..1246
/note="C is T in am E507"
mutation 1372..1372
/note="C is T in am A489"
mutation 1459..1459
/note="C is T in am C208"
mutation 1648..1648
/note="C is T in am E757"
mutation 1669..1669
/note="C is T in am E1236"
misc_signal 1772..1791
/note="pot. transcription terminator (gene 21, 22, 23)"
repeat_unit 1772..1779
/note="inverted repeat A"
repeat_unit 1784..1791
/note="inverted repeat A'"
BASE COUNT 527 a 394 c 429 g 533 t
ORIGIN
1 aagaaatcta ataaagatga aagcactatt actgagagta taaatactcc tgatactgaa
61 gcagccggac tgaatttcgt cactgaagct gtagaagata aagctgcaca gggtgcagaa
121 gatattgtaa gtgtatatgc gaaagtcgca tctcgtttct aattttaaag gttaacacaa
181 atgactatca aaactaaagc tgaacttttg aacaaatgga agccattact ggaaggtgaa
241 ggtttaccgg aaattgctaa tagcaaacaa gcgattatcg ctaaaatctt tgaaaaccag
301 gaaaaagatt tccagacagc tccggaatat aaagacgaaa aaattgctca ggcattcggt
361 tctttcttaa cagaagctga aatcggtggt gaccacggtt acaatgctac caacatcgct
421 gcaggtcaga cttctggcgc agtaactcag attggcccag ctgttatggg tatggtacgt
481 cgtgctattc ctaacctgat tgctttcgat atttgtggtg ttcagccgat gaacagcccg
541 actggccagg tattcgcact gcgcgcagta tatggtaaag acccagtggc tgccggtgct
601 aaagaagcat tccacccaat gtatggtcca gatgcaatgt tctctggtca gggtgctgct
661 aagaaattcc cagctctggc tgctagcaca caaaccacag taggtgatat ctatactcac
721 ttcttccagg aaactggtac tgtatatctg caagcttctg ttcaagtaac aatcgatgct
781 ggtgcgactg atgctgctaa attagatgct gaaattaaga aacaaatgga agctggtgca
841 ctggtagaaa tcgctgaagg tatggctact tctatcgctg aactccagga aggtttcaat
901 ggttctaccg ataacccatg gaatgaaatg ggcttccgta tcgataagca agttatcgaa
961 gctaaatctc gtcagctgaa agctgcttac tctattgaat tagcacaaga cctccgcgct
1021 gttcacggta tggatgctga tgctgaactg tctggtattc tggctacaga aattatgctg
1081 gaaatcaacc gtgaagttgt tgattggatt aactactcag ctcaggttgg taaatctggt
1141 atgaccctga ctccgggttc taaagctggt gtatttgact tccaggaccc aattgatatt
1201 cgtggtgctc gctgggcggg tgaatccttt aaagctctgt tgttccagat tgacaaagaa
1261 gcagttgaaa ttgctcgtca gaccggtcgt ggtgaaggta acttcattat cgcttcccgt
1321 aacgtagtta acgttttggc ttcagttgat accggcattt cttatgctgc acagggtctg
1381 gctaccggct ttagcactga tactaccaag tcagtatttg ctggtgttct gggtggtaaa
1441 taccgcgtat atatcgacca gtatgctaaa caggattatt tcactgtagg ttataaaggt
1501 ccgaacgaaa tggatgctgg tatttactat gctccatatg tagctctgac tccgctgcgt
1561 ggttccgatc cgaagaactt ccaaccggta atgggattca aaactcgtta cggtatcggt
1621 atcaacccat ttgcagaatc cgctgctcag gctccggctt ctcgcatcca gagcggtatg
1681 ccttctattc tgaatagcct tggtaaaaac gcttacttta gacgtgtata tgttaaaggt
1741 atctaatctc taacgataga aacacaattt tagggaacct tcgggttccc tttttctatt
1801 ttatacgata gcaatcaggc atatcatccg catttatcca attgcgaata gttttaggac
1861 taactttaaa atcgtccgct gcg
//