GenBank-Updates@genbank.bio.net (05/25/91)
LOCUS CREPERAS 3111 bp ss-mRNA PLN 25-MAY-1991
DEFINITION Chlamydomonas mRNA for periplasmic arylsulfatase
ACCESSION X52304
KEYWORDS AS gene; arylsulfatase; periplasmic protein.
SOURCE Chlamydomonas reinhardtii RNA.
ORGANISM Chlamydomonas reinhardtii
Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae;
Volvocales; Chlamydomonadaceae.
REFERENCE 1 (bases 481 to 3111)
AUTHORS De,H.E.L., Schilling,J. and Grossman,A.R.
TITLE Structure and expression of the gene encoding the periplasmic
arylsufatase of Chlamydomonas reinhardtii
JOURNAL Mol. Gen. Genet. 218, 229-239 (1989)
STANDARD full automatic
REFERENCE 2 (bases 1 to 3111)
AUTHORS De,H.E.L.
JOURNAL Unpublished (1990)
STANDARD full automatic
COMMENT *source: library=Lambda gt11 (cDNA) and EMBL4 (genomic); *source:
strain=CW15. The sequence is a composite of a cDNA (for CDS region)
and genomic. See <X16179> for intron between bases 717-718. See
<X16180> for bases 616-3104.
From EMBL entry CRPERAS; dated 16-AUG-1990.
FEATURES Location/Qualifiers
misc_feature 481..566
/note="Promoter region"
misc_feature 513..516
/note="transcription start site"
repeat_unit 524..527
/note="inverted repeat A"
repeat_unit 563..566
/note="inverted repeat A'"
CDS 616..2553
/note="protein precursor (AA -21 to 625)"
/codon_start=616
CDS 616..678
/note="signal peptide (AA -21 to -1)"
/codon_start=616
CDS 679..2553
/note="mature protein (AA 1-625)"
/codon_start=679
misc_feature 3088..3095
/note="polyA signal"
polyA_site 3105..3105
/note="polyA site"
BASE COUNT 666 a 977 c 887 g 581 t
ORIGIN
1 cagctgaaag ccagacgtaa cagcaccacg gtggtggtga acacggtggg ctcagagaat
61 ccggatgaag cctgcttttt tatactaagt tggcattata aaaaagcatt gcttatcaat
121 ttgttgcaac gaacaggtca ctatcagtca aaataaaatc attatttgat ttcaattttg
181 tcccactccc tgcctctgtc atcacgatac tgtgatgcca tggtgtccga cttatgcccg
241 agaagatgtt gagcaaactt atcgcttatc tgcttctcat agagtcttgc agacaaactg
301 cgcaactcgt gaaaggtagg cggatctggg aattcccgga tcctctgctg ctcattatac
361 tgcaggccag gcgggcgttg caagtaatca acgagccctc agatgtcaac tgccgactac
421 tagtgctggc cacggggcgg ccaggccggg ttggcgccgg cccccgcccg cctctcccgg
481 ccggttttat atcggcccgc gtatgtcggt ctgtcgtcgt atctcccaat ttgaccttgg
541 ccattaaggc caacgctgtc aagggatcgt ttgggatcgg ggcaatcagc accggctcag
601 gacgaaacca tcaagatggg tgccctcgcg gtgttcgccg tcgcttgcct cgcggcagtg
661 gcgtcggttg cgcatgcggc cgacaccaaa aagcccaact tcgtggtgat attcaccgat
721 gaccaggacg ccattcagaa cagcacccac ccgcactaca tgcccagcct gcacaagtac
781 atccgctacc cgggagtgga gctgtctcag tacttcgtca ccacccccgt gtgctgcccc
841 tcgcggacaa acctgtgcgc ggccagttcg cccacaacac caacttcacc agcgtgctgc
901 ctccctacgg tggctgggcc aagtggaagg gcctgggcat cgaccagtcc tacctgccgc
961 tgtggctcaa ggaccaaggc tataacacct actacgtggg caagttcctt gtggactact
1021 ccgtcagcaa ctaccagcag gtgccgcggg ctgggacgat atcgatgccc tgtcaccccc
1081 tacacctttg actacaacac ccgccttcag cgcaacggcg cgacccccaa catctacccc
1141 ggcgagtaca gcactgacgt cattcgcgac aagggcgttg ctcagatcaa gtcggccgtg
1201 gctgccggaa agccctttta cgcgcagatc tcgcccatcg cgccgcacac ctccacccag
1261 atttccacca accccgccac cggagtgacg aggtcctact tcttcccgcc catccccgcc
1321 cccccgcact ggcagctgtt ctccgacgcc aacctgcccg gcggcagcca acaagaacct
1381 ttacgaggtg gacgtgagcg acaagcccgc ctggatccgc gccctgccgc tggcccagca
1441 gaacaaccgc acctaccagg aggagatcta ccgcctgcgc ctgaggtcgc tgggcccgtg
1501 gacgagctga ttgagcaagt cgtcaagacc ctggatgagg cgggtgtgct tgacaacacc
1561 tacatcatct acagcgctga caatggctac cacgtgggtg cccaccgctt cggcgcgggc
1621 aagaccacgg gctatgagga ggacctgcgt gtgcccttcc tcatccgcgg cccaggcatc
1681 aaggccagca agtccgacaa gccgcagaac agcaaggttg gcctgcacgt ggactttgcg
1741 cccaccattc tgagcctggc cggcgcctcg cacctgctcg gggacaaggg gctggacggc
1801 accccgctgg gcctgtacgc caacgacgac ggcactcttc cgtccgacta ccctcgtccg
1861 gagcagcacc gccagcagtt ccagggcgag ttctggggcg gctggagtga tgagctgctg
1921 cagaacctca ggtcccagcc caacaacact tggaaggtgg tgcgcacgta tgacgagagc
1981 agcaagcagg gatggaagct catcgcgcag tgcaccaacg agcgcgagct gtacgacctg
2041 cgcaaggacc ccggtgagct gtacaacatc tacgacaagg ccaagcccgc cgtgcgcagc
2101 cgcctggagg ggctgctggc ggtgctggcc gtgtgcaagg gggagagctg ctccaacccg
2161 tggaagatcc tgcaccccga cggcaccgtc aagaacttca cccaggcact caactccaag
2221 tacgaccgca tctacaacgc catccgcccc ttcacctaca agaggtgcct gccgtacctg
2281 gattgggaca acgaggacag tcagttcaag acgcagatcc gcggcgccaa ccccgcagcc
2341 ggcgtgggcc accaccgcct gctcaccgcc gccagcgagc gcgccatcgc cacccgccgc
2401 cgcgcccagg ccgccgtcag tgccgagctg gcggacgggc cggctgtgtt ccaggcaaag
2461 gtcgaggaga agtcggtgcc ggtgccccag gacatcctga aggccgacgt ggagaagtgg
2521 ttcgccttca acaatgccga gtactacctg gcataatatt aaacacaaaa gcgatcaaat
2581 cgaagcgcgc atggacatag cgcatcgacc aagcgccacc atcgcttggg tttctatgat
2641 acggttgggt tacgtataat atatgggttt tggacgtggc cgcttggtca gtaagtggtc
2701 cacgtggtac tgccgcgtgc gtgattcccc ccgaatgtat attacgtttc gatgtaatgg
2761 gtgtttccac ggaagttaca ggcaaggtgc gtacataacc cgggatgcga tgaaggtgct
2821 ctgccgtcgt gatcacgagt gggagtgatg gaagataccg gcatgcatct gcgtggcgga
2881 caggcaatgt ggcaggggca tggtttggac tgcggagcca aatgtttagg ctgcttgcag
2941 agagtgtgtt acggcgttgg ccaacacccc aaggcgcacg gtgcgcagct ggtaatccag
3001 gtcggtgctc ttgacgagct ttgggtcgat aagagtgcag cggccttgac gcgtgctgtg
3061 aagattcagc aatctgatta tgccctgtgt aatacagcgc acgaaaaaaa a
//