GenBank-Updates@genbank.bio.net (05/26/91)
LOCUS HUMARYLA 3637 bp ds-DNA PRI 26-MAY-1991 DEFINITION Human DNA for arylsulphatase A (EC 3.1.6.1) ACCESSION X52150 KEYWORDS arylsulphatase; lysosomal enzyme. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3637) AUTHORS Kreysing,J. JOURNAL Unpublished (1990) STANDARD full automatic REFERENCE 2 (bases 1 to 650) AUTHORS Kreysing,J., von,F.K. and Gieselmann,V. TITLE Structure of the arylsulfatase A gene JOURNAL Eur. J. Biochem. 191, 627-631 (1990) STANDARD full automatic COMMENT *source: cell type=leukocytes; library=EMBL-3; clone=G1/1; **map: chromosome=22; See <J04593> for mRNA sequence of arylsulphatase A. Data kindly reviewed (02-NOV-1990) by Hall L. From EMBL entry HSARYLA; dated 31-DEC-1990. FEATURES Location/Qualifiers misc_feature 191..200 /note="GC-box 1" misc_feature 201..210 /note="GC-box 2" misc_feature 213..222 /note="GC-box 3" misc_feature 240..249 /note="GC-box 4" precursor_RNA 256..3356 /note="primary transcript" mRNA 256..847 /note="exon 1" CDS 630..847 /note="arylsulphatase A (AA 1-73) (847 is 2nd base in codon)" /codon_start=630 intron 848..996 /note="intron I" mRNA 997..1237 /note="exon 2" CDS 997..1237 /note="arylsulphatase A (AA 74-153) (997 is 3rd base in codon)" /codon_start=997 intron 1238..1351 /note="intron II" mRNA 1352..1570 /note="exon 3" CDS 1352..1570 /note="arylsulphatase A (AA 154-226)" /codon_start=1352 intron 1571..1644 /note="intron III" mRNA 1645..1814 /note="exon 4" CDS 1645..1814 /note="arylsulphatase A (AA 227-283) (1814 is 2nd base in codon)" /codon_start=1645 intron 1815..2126 /note="intron IV" mRNA 2127..2251 /note="exon 5" CDS 2127..2251 /note="arylsulphatase A (AA 284-324) (2127 is 3rd base in codon) (2251 is 1st base in codon)" /codon_start=2127 intron 2252..2341 /note="intron V" mRNA 2342..2469 /note="exon 6" CDS 2342..2469 /note="arylsulphatase A (AA 325-367) (2342 is 2nd base in codon)" /codon_start=2342 intron 2470..2719 /note="intron VI" mRNA 2720..2822 /note="exon 7" CDS 2720..2822 /note="arylsulphatase A (AA 368-401) (2822 is 1st base in codon)" /codon_start=2720 intron 2823..2937 /note="intron VII" mRNA 2938..3356 /note="exon 8" CDS 2938..3254 /note="arylsulphatase A (AA 402-507) (2938 is 2nd base in codon)" /codon_start=2938 misc_feature 3351..3356 /note="polyA signal" BASE COUNT 566 a 1290 c 1107 g 674 t ORIGIN 1 agccgctcct cctctgagaa gctccggacc cgagaggaca ccgacactgc gcagcgccga 61 gcccgcgcgc agcccggacg cctcagccag ggccgaccgc gcagaggaag ctcccagagc 121 ccgtttcaag accgcagcca acagcctcag gcgcacacgg cggcctcgga gcgagcacgc 181 gcagcaacgc ccctcgcccc ggcccgcccc cggccccgcc ccgcaagggt cacaggtcac 241 ggggcggggc cgaggcggaa gcgcccgcag cccggtaccg gctcctcctg ggctccctct 301 agcgccttcc ccccggcccg actgcctggt cagcgccaag tgacttacgc ccccgaccct 361 gagcccggac cgctaggcga ggaggatcag atctccgctc gagaatctga aggtgccctg 421 gtcctggagg agttccgtcc cagccctgcg gtctcccggt actgctcgcc ccggccctct 481 ggagcttcag gaggcggccg tcagggtcgg ggagtatttg ggtccggggt ctcagggaag 541 ggcggcgcct gggtctgcgg tatcggaaag agcctgctgg agccaagtag ccctccctct 601 cttgggacag acccctcggt cccatgtcca tgggggcacc gcggtccctc ctcctggccc 661 tggctgctgg cctggccgtt gcccgtccgc ccaacatcgt gctgatcttt gccgacgacc 721 tcggctatgg ggacctgggc tgctatgggc accccagctc taccactccc aacctggacc 781 agctggcggc gggagggctg cggttcacag acttctacgt gcctgtgtct ctgtgcacac 841 cctctaggta aagagggggc cgcgcctctt ccccgccccg accctccatc cctttcctcc 901 caatggattg caggggggcg ggaaaaacgt ctgtctctct ctctagggaa ggccacattt 961 ctgtctgtct cagggactct gtgacttgtc ccgcagggcc gccctcctga ccggccggct 1021 cccggttcgg atgggcatgt accctggcgt cctggtgccc agctcccggg ggggcctgcc 1081 cctggaggag gtgaccgtgg ccgaagtcct ggctgcccga ggctacctca caggaatggc 1141 cggcaagtgg caccttgggg tggggcctga gggggccttc ctgccccccc atcagggctt 1201 ccatcgattt ctaggcatcc cgtactccca cgaccaggta ggaaccaccc gggccctcag 1261 ccaccctccc acctcccaaa gtcccccagc cccttgactg tcccgcagcc ccacctgcca 1321 gcccagccct cacggcagct gcccgcctca gggcccctgc cagaacctga cctgcttccc 1381 gccggccact ccttgcgacg gtggctgtga ccagggcctg gtccccatcc cactgttggc 1441 caacctgtcc gtggaggcgc agcccccctg gctgcccgga ctagaggccc gctacatggc 1501 tttcgcccat gacctcatgg ccgacgccca gcgccaggat cgccccttct tcctgtacta 1561 tgcctctcac gtaagtgatc ttggcccaac cccctggctg cccgttgacc cctacccagt 1621 gctaactcca gtctttgccc ccagcacacc cactaccctc agttcagtgg gcagagcttt 1681 gcagagcgtt caggccgcgg gccatttggg gactccctga tggagctgga tgcagctgtg 1741 gggaccctga tgacagccat aggggacctg gggctgcttg aagagacgct ggtcatcttc 1801 actgcagaca atgggtatgc cagcagggca gctgggtgct ccggccctgt cacgggccag 1861 ggcctggagg ccttgcagtt cagctgcttg ccaagaacat agtgggtgag ggggtgccag 1921 gagatgctgg ccacgttgca ggggcccaag gtgtagtcag gagacacagg tgcacagaga 1981 gctggtcttg gtaggcctgg gaggtgccgg gctcatgctg ggcacctccg ggcaagcttt 2041 gtgacttaga ggtgtggggc cactggtcac cctcggtggc tcagaggctg tggctccatg 2101 gctcatgagc gcctcctgtg tcccagacct gagaccatgc gtatgtcccg aggcggctgc 2161 tccggtctct tgcggtgtgg aaagggaacg acctacgagg gcggtgtccg agagcctgcc 2221 ttggccttct ggccaggtca tatcgctccc ggtcagtccg caggccctct ccttggaacc 2281 ctggccccac caccccaacc ttgatggcga actgagtgac tgaccagcct cctgccccca 2341 ggcgtgaccc acgagctggc cagctccctg gacctgctgc ctaccctggc agccctggct 2401 ggggccccac tgcccaatgt caccttggat ggctttgacc tcagccccct gctgctgggc 2461 acaggcaagg tagggccggt gacccctgat cccagatcct tggcccctgt cctggccttc 2521 ccctggggtg agtgtggcag tgctgagagt ctgtgcctca gtgcctcctg cactgagtgg 2581 catccaagtg gcgccacctc tcaggttcct gggtgggcaa gaagcggtgc acgtccaggg 2641 cctcccacca gggctggcag cccaggtatg tgcagtgctt gggcctgccc cgccccgtga 2701 cccctgactc tgcccccaga gccctcggca gtctctcttc ttctacccgt cctacccaga 2761 cgaggtccgt ggggtttttg ctgtgcggac tggaaagtac aaggctcact tcttcaccca 2821 gggtaacccc tccccgtgga tccctccccc cgaacctgct gacccctccc cggagcccta 2881 gatccctggc ccctcctctc gcccttgccc tgtgcacaga attggccccc tccccaggct 2941 ctgcccacag tgataccact gcagaccctg cctgccacgc ctccagctct ctgactgctc 3001 atgagccccc gctgctctat gacctgtcca aggaccctgg tgagaactac aacctgctgg 3061 ggggtgtggc cggggccacc ccagaggtgc tgcaagccct gaaacagctt cagctgctca 3121 aggcccagtt agacgcagct gtgaccttcg gccccagcca ggtggcccgg ggcgaggacc 3181 ccgccctgca gatctgctgt catcctggct gcaccccccg cccagcttgc tgccattgcc 3241 cagatcccca tgcctgaggg cccctcggct ggcctgggca tgtgatggct cctcactggg 3301 agttgtgggg gaggctcagg tgtctggagg gggtttgtgc ctgataacgt aataacacca 3361 gtggagactt gcagctgtga caattcgacc aatcctgggg taatgctgtg tgctggtgcc 3421 ggtcccctgt ggtacgaatg aggaaactga ggtgcagaga ggttcaggac ttgtacaaga 3481 tcacccagcc agaaagaggt tgggctggga tttgaaccct ggtgtcgtgg ctctggaagc 3541 tgccctggcg ctccttggtg atctgcgtgg gtctgtgcac acaggcacac gtcagccaca 3601 aggcacatgg acgagcgcac gtgcttgagt gcaggac //