[bionet.molbio.genbank.updates] Bacteriophage T4 gene 23 for major capsid protein

GenBank-Updates@genbank.bio.net (05/25/91)

LOCUS       PT4T4G23     1883 bp ds-DNA             PHG       25-MAY-1991
DEFINITION  Bacteriophage T4 gene 23 for major capsid protein
ACCESSION   X01774 J02507
KEYWORDS    capsid protein; inverted repeat; overlapping genes.
SOURCE      Bacteriophage T4 DNA.
  ORGANISM  Bacteriophage T4
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (sites)
  AUTHORS   Christensen,A.C. and Young,E.T.
  TITLE     T4 late transcripts are initiated near a conserved DNA sequence
  JOURNAL   Nature 299, 369-371 (1982)
  STANDARD  full staff_review
REFERENCE   2  (sites)
  AUTHORS   Elliott,T. and Geiduschek,E.P.
  JOURNAL   Cell 36, 211-219 (1984)
  STANDARD  full staff_review
REFERENCE   3  (bases 1 to 1883)
  AUTHORS   Parker,M.L., Christensen,A.C., Boosman,A., Stockard,J., Young,E.T.
            and Doermann,A.H.
  TITLE     Nucleotide sequence of bacteriophage T4 gene 23 and the amino acid
            sequence of its product
  JOURNAL   J. Mol. Biol. 180, 399-416 (1984)
  STANDARD  full automatic
COMMENT     SWISS-PROT; P04535; COAT$BPT4.
            
            From EMBL    entry MYT4G23;  dated 22-APR-1990.
FEATURES             Location/Qualifiers
     CDS             <1..159
                     /note="gp22 (gene 22)"
                     /codon_start=1
     promoter        39..42
                     /note="put. promoter (gene 23)"
     RBS             169..172
                     /note="pot. ribosome binding site"
     CDS             181..1743
                     /note="gp23: major capsid protein (gene 23) (aa 1-521)"
                     /codon_start=181
     mutation        313..313
                     /note="C is T in am H11"
     mutation        427..427
                     /note="C is T in am C140"
     mutation        448..448
                     /note="C is T in am B17"
     mutation        649..649
                     /note="C is T in am B272"
     mutation        727..727
                     /note="C is T in am H32 and am E509"
     mutation        886..886
                     /note="C is T in am E506"
     mutation        920..920
                     /note="G is A in am E161"
     mutation        1106..1106
                     /note="G is A in am B278 and am E1270"
     mutation        1123..1123
                     /note="C is T in am C137"
     mutation        1183..1183
                     /note="C is T in am H36 and am E389"
     mutation        1246..1246
                     /note="C is T in am E507"
     mutation        1372..1372
                     /note="C is T in am A489"
     mutation        1459..1459
                     /note="C is T in am C208"
     mutation        1648..1648
                     /note="C is T in am E757"
     mutation        1669..1669
                     /note="C is T in am E1236"
     misc_signal     1772..1791
                     /note="pot. transcription terminator (gene 21, 22, 23)"
     repeat_unit     1772..1779
                     /note="inverted repeat A"
     repeat_unit     1784..1791
                     /note="inverted repeat A'"
BASE COUNT      527 a    394 c    429 g    533 t
ORIGIN
        1 aagaaatcta ataaagatga aagcactatt actgagagta taaatactcc tgatactgaa
       61 gcagccggac tgaatttcgt cactgaagct gtagaagata aagctgcaca gggtgcagaa
      121 gatattgtaa gtgtatatgc gaaagtcgca tctcgtttct aattttaaag gttaacacaa
      181 atgactatca aaactaaagc tgaacttttg aacaaatgga agccattact ggaaggtgaa
      241 ggtttaccgg aaattgctaa tagcaaacaa gcgattatcg ctaaaatctt tgaaaaccag
      301 gaaaaagatt tccagacagc tccggaatat aaagacgaaa aaattgctca ggcattcggt
      361 tctttcttaa cagaagctga aatcggtggt gaccacggtt acaatgctac caacatcgct
      421 gcaggtcaga cttctggcgc agtaactcag attggcccag ctgttatggg tatggtacgt
      481 cgtgctattc ctaacctgat tgctttcgat atttgtggtg ttcagccgat gaacagcccg
      541 actggccagg tattcgcact gcgcgcagta tatggtaaag acccagtggc tgccggtgct
      601 aaagaagcat tccacccaat gtatggtcca gatgcaatgt tctctggtca gggtgctgct
      661 aagaaattcc cagctctggc tgctagcaca caaaccacag taggtgatat ctatactcac
      721 ttcttccagg aaactggtac tgtatatctg caagcttctg ttcaagtaac aatcgatgct
      781 ggtgcgactg atgctgctaa attagatgct gaaattaaga aacaaatgga agctggtgca
      841 ctggtagaaa tcgctgaagg tatggctact tctatcgctg aactccagga aggtttcaat
      901 ggttctaccg ataacccatg gaatgaaatg ggcttccgta tcgataagca agttatcgaa
      961 gctaaatctc gtcagctgaa agctgcttac tctattgaat tagcacaaga cctccgcgct
     1021 gttcacggta tggatgctga tgctgaactg tctggtattc tggctacaga aattatgctg
     1081 gaaatcaacc gtgaagttgt tgattggatt aactactcag ctcaggttgg taaatctggt
     1141 atgaccctga ctccgggttc taaagctggt gtatttgact tccaggaccc aattgatatt
     1201 cgtggtgctc gctgggcggg tgaatccttt aaagctctgt tgttccagat tgacaaagaa
     1261 gcagttgaaa ttgctcgtca gaccggtcgt ggtgaaggta acttcattat cgcttcccgt
     1321 aacgtagtta acgttttggc ttcagttgat accggcattt cttatgctgc acagggtctg
     1381 gctaccggct ttagcactga tactaccaag tcagtatttg ctggtgttct gggtggtaaa
     1441 taccgcgtat atatcgacca gtatgctaaa caggattatt tcactgtagg ttataaaggt
     1501 ccgaacgaaa tggatgctgg tatttactat gctccatatg tagctctgac tccgctgcgt
     1561 ggttccgatc cgaagaactt ccaaccggta atgggattca aaactcgtta cggtatcggt
     1621 atcaacccat ttgcagaatc cgctgctcag gctccggctt ctcgcatcca gagcggtatg
     1681 ccttctattc tgaatagcct tggtaaaaac gcttacttta gacgtgtata tgttaaaggt
     1741 atctaatctc taacgataga aacacaattt tagggaacct tcgggttccc tttttctatt
     1801 ttatacgata gcaatcaggc atatcatccg catttatcca attgcgaata gttttaggac
     1861 taactttaaa atcgtccgct gcg
//