GenBank-Updates@genbank.bio.net (05/18/91)
LOCUS ECORSPA 2412 bp ds-DNA BCT 18-MAY-1991
DEFINITION E. coli gene rspA coding for the S1 ribosomal protein. Also
contains an open reading frame of unknown function.
ACCESSION V00352 J01681 J01682
KEYWORDS ribosomal protein; ribosomal protein S1;
unidentified reading frame.
SOURCE Escherichia coli DNA.
ORGANISM Escherichia coli
Prokaryota; Bacteria; Gracilicutes; Scotobacteria;
Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE 1 (bases 1 to 2412)
AUTHORS Schnier,J. and Isono,K.
TITLE The DNA sequence of the gene rspA of Escherichia coli coding for
ribosomal protein S1
JOURNAL Nucleic Acids Res. 10, 1857-1865 (1982)
STANDARD full automatic
COMMENT SWISS-PROT; P02349; RS1$ECOLI.
From EMBL 26 entry ECRSPA; dated 22-FEB-1991.
FEATURES Location/Qualifiers
CDS 293..640
/note="unknown protein"
/codon_start=293
CDS 646..2313
/product="S1 ribosomal protein"
/gene="rpsA"
/codon_start=646
unsure 1885..1887
/note="Ile-codon not readable as DNA"
BASE COUNT 624 a 576 c 658 g 554 t
ORIGIN
1 actggcattg gcggcattac atcaccatgt tgatgttgcg tcggaagatc ggctggtacc
61 gctggcatcc atctggatgt acgtttgtgt cgaccaatgg caatcctgga agtgatcctc
121 gaaggggaag atgtcagcgg cgaaattcgt actcaggaag tggcgaatgc agcttcacaa
181 gtcgcggcat tccacgcgtt cgtgaagcat tattgcgtcg ccaacgcgcg tttgcgcaat
241 taccagggtc tgattgccga tggccgcgac atgggaacgc tggtattccc tgatgcacca
301 gtgattaatt ttccttgacg cgtcctcgga agaacgtgcg catcgccgca tgctacagtt
361 gcaggagaag ggctttagtg ttaactttga gcgccttttg gccgagatca aagaagcgac
421 gaccgcgatc gtaaccgagc cggtaccgcc actggttccg gcagccgatg ctttagtgtt
481 ggattccacc accttaagca ttgagcaagt gattgaaaaa gcgctacaat acgcgcgcag
541 aaattggctc tcgcataagc gaccgaattt gcagtacccc cgttgcaatg gaatgaccat
601 ccgcatggag ccaggtggag ttaaatataa acctgaagat taaacatgac tgaatctttt
661 gctcaactct ttgaagagtc cttaaaagaa atcgaaaccc gcccgggttc tatcgttcgt
721 ggcgttgttg ttgctatcga caaagacgta gtactggttg acgctggtct gaaatctgag
781 tccgccatcc cggctgagca gttcaaaaac gcccagggcg agctggaaat ccaggtaggt
841 gacgaagttg acgttgctct ggacgcagta gaagacggct tcggtgaaac tctgctgtcc
901 cgtgagaaag ctaaacgtca cgaagcctgg atcacgctgg aaaaagctta cgaagatgct
961 gaaactgtta ccggtgttat caacggcaaa gttaagggcg gcttcactgt tgagctgaac
1021 ggtattcgtg cgttcctgcc aggttctctg gtagacgttc gtccggtgcg tgacactctg
1081 cacctggaag gcaaagagct tgaatttaaa gtaatcaagc tggatcagaa gcgcaacaac
1141 gttgttgttt ctcgtcgtgc cgttatcgaa tccgaaaaca gcgcagacga tcagctgctg
1201 gaaaacctgc aggaaggcat ggaagttaaa ggtatcgtta agaacctcac tgactacggt
1261 gcattcgttg atctgggcgg cgttgacggc ctgctgcaca tcactgacat ggcctggaaa
1321 cgcgttaagc atccgagcga aatcgtcaac gtgggcgacg aaatcactgt taaagtgctg
1381 aagttcgacc gcgaacgtac ccgtgtatcc ctgggcctga aacagctggg cgaagatccg
1441 tgggtagcta tcgctaaacg ttatccggaa ggtaccaaac tgactggtcg cgtgaccaac
1501 ctgaccgact acggctgctt cgttgaaatc gaagaaggcg ttgaaggcct ggtacacgtt
1561 tccgaaatgg actggaccaa caaaaacatc cacccgtcca aagttgttaa cgttggcgat
1621 gtagtggaag ttatggttct ggatatcgac gaagaacgtc gtcgtatctc cctgggtctg
1681 aaacagtgca aagctaaccc gtggcagcag ttcgcggaaa cccacaacaa gggcgaccgt
1741 gttgaaggta aaatcaagtc tatcactgac ttcggtatct tcatcggctt ggacggcggc
1801 atcgacggcc tggttcacct gtctgacatc tcctggaacg ttgcaggcga agaagcagtt
1861 cgtgaataca aaaaaggcga cgaaatcgct gcagttgttc tgcaggttga cgcagaacgt
1921 gaacgtatct ccctgggcgt taaacagctc gcagaagatc cgttcaacaa ctgggttgct
1981 ctgaacaaga aaggcgttat cgtaaccggt aaagtaactg cagttgacgc taaaggcgca
2041 accgtagaac tggctgacgg cgttgaaggt tacctgcgtg cttctgaagc atcccgtgac
2101 cgcgttgaag acgctaccct ggttctgagc gttggcgacg aagttgaagc taaattcacc
2161 ggcgttgatc gtaaaaaccg cgcaatcagc ctgtctgttc gtgcgaaaga cgaagctgac
2221 gagaaagatg caatcgcaac tgttaacaaa caggaagatg caaacttctc caacaacgca
2281 atggctgaag ctttcaaagc agctaaaggc gagtaattct ctgactcttc gggattttta
2341 ttccgaagtt tgttgagttt acttgacaga ttgcaggttt cgtccctgta atcaagcact
2401 aagggcggct ac
//