GenBank-Updates@genbank.bio.net (05/25/91)
LOCUS CREPERAS 3111 bp ss-mRNA PLN 25-MAY-1991 DEFINITION Chlamydomonas mRNA for periplasmic arylsulfatase ACCESSION X52304 KEYWORDS AS gene; arylsulfatase; periplasmic protein. SOURCE Chlamydomonas reinhardtii RNA. ORGANISM Chlamydomonas reinhardtii Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; Volvocales; Chlamydomonadaceae. REFERENCE 1 (bases 481 to 3111) AUTHORS De,H.E.L., Schilling,J. and Grossman,A.R. TITLE Structure and expression of the gene encoding the periplasmic arylsufatase of Chlamydomonas reinhardtii JOURNAL Mol. Gen. Genet. 218, 229-239 (1989) STANDARD full automatic REFERENCE 2 (bases 1 to 3111) AUTHORS De,H.E.L. JOURNAL Unpublished (1990) STANDARD full automatic COMMENT *source: library=Lambda gt11 (cDNA) and EMBL4 (genomic); *source: strain=CW15. The sequence is a composite of a cDNA (for CDS region) and genomic. See <X16179> for intron between bases 717-718. See <X16180> for bases 616-3104. From EMBL entry CRPERAS; dated 16-AUG-1990. FEATURES Location/Qualifiers misc_feature 481..566 /note="Promoter region" misc_feature 513..516 /note="transcription start site" repeat_unit 524..527 /note="inverted repeat A" repeat_unit 563..566 /note="inverted repeat A'" CDS 616..2553 /note="protein precursor (AA -21 to 625)" /codon_start=616 CDS 616..678 /note="signal peptide (AA -21 to -1)" /codon_start=616 CDS 679..2553 /note="mature protein (AA 1-625)" /codon_start=679 misc_feature 3088..3095 /note="polyA signal" polyA_site 3105..3105 /note="polyA site" BASE COUNT 666 a 977 c 887 g 581 t ORIGIN 1 cagctgaaag ccagacgtaa cagcaccacg gtggtggtga acacggtggg ctcagagaat 61 ccggatgaag cctgcttttt tatactaagt tggcattata aaaaagcatt gcttatcaat 121 ttgttgcaac gaacaggtca ctatcagtca aaataaaatc attatttgat ttcaattttg 181 tcccactccc tgcctctgtc atcacgatac tgtgatgcca tggtgtccga cttatgcccg 241 agaagatgtt gagcaaactt atcgcttatc tgcttctcat agagtcttgc agacaaactg 301 cgcaactcgt gaaaggtagg cggatctggg aattcccgga tcctctgctg ctcattatac 361 tgcaggccag gcgggcgttg caagtaatca acgagccctc agatgtcaac tgccgactac 421 tagtgctggc cacggggcgg ccaggccggg ttggcgccgg cccccgcccg cctctcccgg 481 ccggttttat atcggcccgc gtatgtcggt ctgtcgtcgt atctcccaat ttgaccttgg 541 ccattaaggc caacgctgtc aagggatcgt ttgggatcgg ggcaatcagc accggctcag 601 gacgaaacca tcaagatggg tgccctcgcg gtgttcgccg tcgcttgcct cgcggcagtg 661 gcgtcggttg cgcatgcggc cgacaccaaa aagcccaact tcgtggtgat attcaccgat 721 gaccaggacg ccattcagaa cagcacccac ccgcactaca tgcccagcct gcacaagtac 781 atccgctacc cgggagtgga gctgtctcag tacttcgtca ccacccccgt gtgctgcccc 841 tcgcggacaa acctgtgcgc ggccagttcg cccacaacac caacttcacc agcgtgctgc 901 ctccctacgg tggctgggcc aagtggaagg gcctgggcat cgaccagtcc tacctgccgc 961 tgtggctcaa ggaccaaggc tataacacct actacgtggg caagttcctt gtggactact 1021 ccgtcagcaa ctaccagcag gtgccgcggg ctgggacgat atcgatgccc tgtcaccccc 1081 tacacctttg actacaacac ccgccttcag cgcaacggcg cgacccccaa catctacccc 1141 ggcgagtaca gcactgacgt cattcgcgac aagggcgttg ctcagatcaa gtcggccgtg 1201 gctgccggaa agccctttta cgcgcagatc tcgcccatcg cgccgcacac ctccacccag 1261 atttccacca accccgccac cggagtgacg aggtcctact tcttcccgcc catccccgcc 1321 cccccgcact ggcagctgtt ctccgacgcc aacctgcccg gcggcagcca acaagaacct 1381 ttacgaggtg gacgtgagcg acaagcccgc ctggatccgc gccctgccgc tggcccagca 1441 gaacaaccgc acctaccagg aggagatcta ccgcctgcgc ctgaggtcgc tgggcccgtg 1501 gacgagctga ttgagcaagt cgtcaagacc ctggatgagg cgggtgtgct tgacaacacc 1561 tacatcatct acagcgctga caatggctac cacgtgggtg cccaccgctt cggcgcgggc 1621 aagaccacgg gctatgagga ggacctgcgt gtgcccttcc tcatccgcgg cccaggcatc 1681 aaggccagca agtccgacaa gccgcagaac agcaaggttg gcctgcacgt ggactttgcg 1741 cccaccattc tgagcctggc cggcgcctcg cacctgctcg gggacaaggg gctggacggc 1801 accccgctgg gcctgtacgc caacgacgac ggcactcttc cgtccgacta ccctcgtccg 1861 gagcagcacc gccagcagtt ccagggcgag ttctggggcg gctggagtga tgagctgctg 1921 cagaacctca ggtcccagcc caacaacact tggaaggtgg tgcgcacgta tgacgagagc 1981 agcaagcagg gatggaagct catcgcgcag tgcaccaacg agcgcgagct gtacgacctg 2041 cgcaaggacc ccggtgagct gtacaacatc tacgacaagg ccaagcccgc cgtgcgcagc 2101 cgcctggagg ggctgctggc ggtgctggcc gtgtgcaagg gggagagctg ctccaacccg 2161 tggaagatcc tgcaccccga cggcaccgtc aagaacttca cccaggcact caactccaag 2221 tacgaccgca tctacaacgc catccgcccc ttcacctaca agaggtgcct gccgtacctg 2281 gattgggaca acgaggacag tcagttcaag acgcagatcc gcggcgccaa ccccgcagcc 2341 ggcgtgggcc accaccgcct gctcaccgcc gccagcgagc gcgccatcgc cacccgccgc 2401 cgcgcccagg ccgccgtcag tgccgagctg gcggacgggc cggctgtgtt ccaggcaaag 2461 gtcgaggaga agtcggtgcc ggtgccccag gacatcctga aggccgacgt ggagaagtgg 2521 ttcgccttca acaatgccga gtactacctg gcataatatt aaacacaaaa gcgatcaaat 2581 cgaagcgcgc atggacatag cgcatcgacc aagcgccacc atcgcttggg tttctatgat 2641 acggttgggt tacgtataat atatgggttt tggacgtggc cgcttggtca gtaagtggtc 2701 cacgtggtac tgccgcgtgc gtgattcccc ccgaatgtat attacgtttc gatgtaatgg 2761 gtgtttccac ggaagttaca ggcaaggtgc gtacataacc cgggatgcga tgaaggtgct 2821 ctgccgtcgt gatcacgagt gggagtgatg gaagataccg gcatgcatct gcgtggcgga 2881 caggcaatgt ggcaggggca tggtttggac tgcggagcca aatgtttagg ctgcttgcag 2941 agagtgtgtt acggcgttgg ccaacacccc aaggcgcacg gtgcgcagct ggtaatccag 3001 gtcggtgctc ttgacgagct ttgggtcgat aagagtgcag cggccttgac gcgtgctgtg 3061 aagattcagc aatctgatta tgccctgtgt aatacagcgc acgaaaaaaa a //