GenBank-Updates@genbank.bio.net (05/27/91)
LOCUS MUSCRPG 2140 bp ds-DNA ROD 27-MAY-1991
DEFINITION Murine crp gene for C-reactive protein
ACCESSION X13588
KEYWORDS C-reactive protein; C-reactive protein gene; crp gene.
SOURCE Mus musculus DNA.
ORGANISM Mus musculus
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE 1 (bases 1 to 2140)
AUTHORS Ohnishi,S., Maeda,S., Nishiguchi,S., Arao,T. and Shimada,K.
TITLE Structure of the mouse c-reactive protein gene
JOURNAL Biochem. Biophys. Res. Commun. 156, 814-822 (1988)
STANDARD full automatic
COMMENT SWISS-PROT; P14847; CRP$MOUSE.
*source: clone=Lm mP-10;
From EMBL entry MMCRPG; dated 21-JAN-1990.
FEATURES Location/Qualifiers
misc_feature 32..39
/note="TRE-like sequence"
misc_feature 33..40
/note="TRE-like sequence"
misc_feature 62..67
/note="HNF1-like sequence"
misc_feature 84..93
/note="HSE-like sequence"
promoter 161..165
/note="CAAT-box"
misc_feature 162..167
/note="HNF1-like sequence"
promoter 200..205
/note="TATA-box"
precursor_RNA 227..2122
/note="primary transcript"
mRNA 227..374
/note="Exon 1"
misc_feature 234..243
/note="HSE-like sequence"
misc_feature 252..264
/note="HSE-like sequence"
CDS 311..374
/note="precursor (AA -20 to 1) (374 is 1st base in codon)"
/codon_start=311
CDS 311..370
/note="signal peptide (AA -20 to -1)"
/codon_start=311
CDS 371..374
/note="C-reactive protein (AA 1) (374 is 1st base in
codon)"
/codon_start=371
intron 375..587
/note="Intron I"
mRNA 588..2122
/note="Exon 2"
CDS 588..1198
/note="C-reactive protein (AA 2-205) (588 is 2nd base in
codon)"
/codon_start=588
misc_feature 2102..2107
/note="pot. polyA signal"
polyA_site 2122..2122
/note="polyA site"
BASE COUNT 560 a 452 c 438 g 690 t
ORIGIN
1 ctttctcatt tttcctgtca cacagaagct ggtgattcag gggtcacagg agtttgtaat
61 aaataaccca cattgatttc tctgttctag aatgattttt tttttgcttc cctttctccc
121 agtggtctga cgtttacccc aagaggcagt gttaggaaat catttacaaa gtggttcagc
181 ccctccatct gctatagtta taaatctgag gatgggctgg gcccgaggca ggcgttccag
241 gactccttgt ccttgatctt tcagacaaaa cactgtcctc ttagtccaga tcccagcagc
301 atccatagcc atggagaagc tactctggtg ccttctgatc atgatcagct tctctcggac
361 ttttggtcat gaaggtagga gctatcataa agatcttttc cctatgggag aatggttgga
421 acttaatatt ttgcataagg aatcaaggat caggatcagg gtagctgtgt atttatgtaa
481 cctgggagag gaccagatga cccttgatcc caaactctac ctgtaaggga ggaataagtc
541 ttcattatct gagaaactac ttactttctt ggttttctgt ttcacagaca tgtttaaaaa
601 ggcctttgta tttcccaagg agtcagatac ttcctatgtg tctctggaag cagagtcaaa
661 gaagccactg aacaccttta ctgtgtgtct ccatttctac actgctctga gcacagtgcg
721 cagcttcagt gtcttctctt atgctaccaa gaagaactct aacgacattc tcatattttg
781 gaataaggat aaacagtata cttttggagt gggtggtgct gaagtacgat tcatggtttc
841 agagattcct gaggctccaa cacacatctg tgccagctgg gagtctgcta cggggattgt
901 agagttctgg attgatggga aagccaaggt gcggaaaagt ctgcacaagg gctacactgt
961 ggggccagat gcaagcatca tcttggggca ggagcaggac tcgtatggcg gtgactttga
1021 tgcaaagcag tctttggtgg gagacatcgg agatgtgaac atgtgggatt ttgtgctatc
1081 tccagaacag atcaacacag tctatgttgg tgggacactc agccccaatg ttttgaactg
1141 gcgggcactg aactataaag cacagggtga tgtgtttatt aagccgcagc tgtggtcctg
1201 acctactgtt gtgaaccctg aagcacctcc tgggattaca ttctctccct tgtctcgggt
1261 tatgaacctt ttagccccag cagatgttgt aggtctgttc tgtgaatatg gcctttcact
1321 tctctgcttt gtggtcctca gcactagagc acggaattta aatggaaggc ttccagcata
1381 agcatcccac taggactcta ccaaagagaa tctgacttac ccatggtttt atatatatat
1441 gtaaatatcc atatatatat acatatatac atatatatat atatacatac atatatatat
1501 atatatatat atataattga aaaaatttca gacataattc ttctccctca catagatgag
1561 aaaatagatg cacagaaagg agaataattt tttattgttt ttgttttata atgtcatctt
1621 gagtgctgta tttacatact ttctatccct ccctcttcag atcctttcct atccttccaa
1681 attctctctc aaattcatga tgtcttatta ttagtcttat gcatatatac atatgcataa
1741 tacctatcat ctatcaatca atctatctac ctatctatca tctattcatc agtcatccat
1801 cttactgatt acatttagtg cttcttgtat tttgttgaag actggacact ggataatcta
1861 tcaggagggc ccctccctga agactgattg tccttttctc agcagccact gattacctct
1921 agctcttcat atagggttct gtctttgtga aatttcttct gtccatgttg catgtcaatt
1981 ggtgtcagta tgcaggtctt gtttgggcaa cctagagtga tggagcactg actacactgt
2041 gctcagaatc agttcttttc tggaataaaa tctgtacctg aacttcccca gtccatgagt
2101 caataaagtc acctttggct tgaatgaatt tgagcagttt
//