GenBank-Updates@genbank.bio.net (05/26/91)
LOCUS HUMARYLA 3637 bp ds-DNA PRI 26-MAY-1991
DEFINITION Human DNA for arylsulphatase A (EC 3.1.6.1)
ACCESSION X52150
KEYWORDS arylsulphatase; lysosomal enzyme.
SOURCE Homo sapiens DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 3637)
AUTHORS Kreysing,J.
JOURNAL Unpublished (1990)
STANDARD full automatic
REFERENCE 2 (bases 1 to 650)
AUTHORS Kreysing,J., von,F.K. and Gieselmann,V.
TITLE Structure of the arylsulfatase A gene
JOURNAL Eur. J. Biochem. 191, 627-631 (1990)
STANDARD full automatic
COMMENT *source: cell type=leukocytes; library=EMBL-3; clone=G1/1; **map:
chromosome=22; See <J04593> for mRNA sequence of arylsulphatase A.
Data kindly reviewed (02-NOV-1990) by Hall L.
From EMBL entry HSARYLA; dated 31-DEC-1990.
FEATURES Location/Qualifiers
misc_feature 191..200
/note="GC-box 1"
misc_feature 201..210
/note="GC-box 2"
misc_feature 213..222
/note="GC-box 3"
misc_feature 240..249
/note="GC-box 4"
precursor_RNA 256..3356
/note="primary transcript"
mRNA 256..847
/note="exon 1"
CDS 630..847
/note="arylsulphatase A (AA 1-73) (847 is 2nd base in
codon)"
/codon_start=630
intron 848..996
/note="intron I"
mRNA 997..1237
/note="exon 2"
CDS 997..1237
/note="arylsulphatase A (AA 74-153) (997 is 3rd base in
codon)"
/codon_start=997
intron 1238..1351
/note="intron II"
mRNA 1352..1570
/note="exon 3"
CDS 1352..1570
/note="arylsulphatase A (AA 154-226)"
/codon_start=1352
intron 1571..1644
/note="intron III"
mRNA 1645..1814
/note="exon 4"
CDS 1645..1814
/note="arylsulphatase A (AA 227-283) (1814 is 2nd base in
codon)"
/codon_start=1645
intron 1815..2126
/note="intron IV"
mRNA 2127..2251
/note="exon 5"
CDS 2127..2251
/note="arylsulphatase A (AA 284-324) (2127 is 3rd base in
codon) (2251 is 1st base in codon)"
/codon_start=2127
intron 2252..2341
/note="intron V"
mRNA 2342..2469
/note="exon 6"
CDS 2342..2469
/note="arylsulphatase A (AA 325-367) (2342 is 2nd base in
codon)"
/codon_start=2342
intron 2470..2719
/note="intron VI"
mRNA 2720..2822
/note="exon 7"
CDS 2720..2822
/note="arylsulphatase A (AA 368-401) (2822 is 1st base in
codon)"
/codon_start=2720
intron 2823..2937
/note="intron VII"
mRNA 2938..3356
/note="exon 8"
CDS 2938..3254
/note="arylsulphatase A (AA 402-507) (2938 is 2nd base in
codon)"
/codon_start=2938
misc_feature 3351..3356
/note="polyA signal"
BASE COUNT 566 a 1290 c 1107 g 674 t
ORIGIN
1 agccgctcct cctctgagaa gctccggacc cgagaggaca ccgacactgc gcagcgccga
61 gcccgcgcgc agcccggacg cctcagccag ggccgaccgc gcagaggaag ctcccagagc
121 ccgtttcaag accgcagcca acagcctcag gcgcacacgg cggcctcgga gcgagcacgc
181 gcagcaacgc ccctcgcccc ggcccgcccc cggccccgcc ccgcaagggt cacaggtcac
241 ggggcggggc cgaggcggaa gcgcccgcag cccggtaccg gctcctcctg ggctccctct
301 agcgccttcc ccccggcccg actgcctggt cagcgccaag tgacttacgc ccccgaccct
361 gagcccggac cgctaggcga ggaggatcag atctccgctc gagaatctga aggtgccctg
421 gtcctggagg agttccgtcc cagccctgcg gtctcccggt actgctcgcc ccggccctct
481 ggagcttcag gaggcggccg tcagggtcgg ggagtatttg ggtccggggt ctcagggaag
541 ggcggcgcct gggtctgcgg tatcggaaag agcctgctgg agccaagtag ccctccctct
601 cttgggacag acccctcggt cccatgtcca tgggggcacc gcggtccctc ctcctggccc
661 tggctgctgg cctggccgtt gcccgtccgc ccaacatcgt gctgatcttt gccgacgacc
721 tcggctatgg ggacctgggc tgctatgggc accccagctc taccactccc aacctggacc
781 agctggcggc gggagggctg cggttcacag acttctacgt gcctgtgtct ctgtgcacac
841 cctctaggta aagagggggc cgcgcctctt ccccgccccg accctccatc cctttcctcc
901 caatggattg caggggggcg ggaaaaacgt ctgtctctct ctctagggaa ggccacattt
961 ctgtctgtct cagggactct gtgacttgtc ccgcagggcc gccctcctga ccggccggct
1021 cccggttcgg atgggcatgt accctggcgt cctggtgccc agctcccggg ggggcctgcc
1081 cctggaggag gtgaccgtgg ccgaagtcct ggctgcccga ggctacctca caggaatggc
1141 cggcaagtgg caccttgggg tggggcctga gggggccttc ctgccccccc atcagggctt
1201 ccatcgattt ctaggcatcc cgtactccca cgaccaggta ggaaccaccc gggccctcag
1261 ccaccctccc acctcccaaa gtcccccagc cccttgactg tcccgcagcc ccacctgcca
1321 gcccagccct cacggcagct gcccgcctca gggcccctgc cagaacctga cctgcttccc
1381 gccggccact ccttgcgacg gtggctgtga ccagggcctg gtccccatcc cactgttggc
1441 caacctgtcc gtggaggcgc agcccccctg gctgcccgga ctagaggccc gctacatggc
1501 tttcgcccat gacctcatgg ccgacgccca gcgccaggat cgccccttct tcctgtacta
1561 tgcctctcac gtaagtgatc ttggcccaac cccctggctg cccgttgacc cctacccagt
1621 gctaactcca gtctttgccc ccagcacacc cactaccctc agttcagtgg gcagagcttt
1681 gcagagcgtt caggccgcgg gccatttggg gactccctga tggagctgga tgcagctgtg
1741 gggaccctga tgacagccat aggggacctg gggctgcttg aagagacgct ggtcatcttc
1801 actgcagaca atgggtatgc cagcagggca gctgggtgct ccggccctgt cacgggccag
1861 ggcctggagg ccttgcagtt cagctgcttg ccaagaacat agtgggtgag ggggtgccag
1921 gagatgctgg ccacgttgca ggggcccaag gtgtagtcag gagacacagg tgcacagaga
1981 gctggtcttg gtaggcctgg gaggtgccgg gctcatgctg ggcacctccg ggcaagcttt
2041 gtgacttaga ggtgtggggc cactggtcac cctcggtggc tcagaggctg tggctccatg
2101 gctcatgagc gcctcctgtg tcccagacct gagaccatgc gtatgtcccg aggcggctgc
2161 tccggtctct tgcggtgtgg aaagggaacg acctacgagg gcggtgtccg agagcctgcc
2221 ttggccttct ggccaggtca tatcgctccc ggtcagtccg caggccctct ccttggaacc
2281 ctggccccac caccccaacc ttgatggcga actgagtgac tgaccagcct cctgccccca
2341 ggcgtgaccc acgagctggc cagctccctg gacctgctgc ctaccctggc agccctggct
2401 ggggccccac tgcccaatgt caccttggat ggctttgacc tcagccccct gctgctgggc
2461 acaggcaagg tagggccggt gacccctgat cccagatcct tggcccctgt cctggccttc
2521 ccctggggtg agtgtggcag tgctgagagt ctgtgcctca gtgcctcctg cactgagtgg
2581 catccaagtg gcgccacctc tcaggttcct gggtgggcaa gaagcggtgc acgtccaggg
2641 cctcccacca gggctggcag cccaggtatg tgcagtgctt gggcctgccc cgccccgtga
2701 cccctgactc tgcccccaga gccctcggca gtctctcttc ttctacccgt cctacccaga
2761 cgaggtccgt ggggtttttg ctgtgcggac tggaaagtac aaggctcact tcttcaccca
2821 gggtaacccc tccccgtgga tccctccccc cgaacctgct gacccctccc cggagcccta
2881 gatccctggc ccctcctctc gcccttgccc tgtgcacaga attggccccc tccccaggct
2941 ctgcccacag tgataccact gcagaccctg cctgccacgc ctccagctct ctgactgctc
3001 atgagccccc gctgctctat gacctgtcca aggaccctgg tgagaactac aacctgctgg
3061 ggggtgtggc cggggccacc ccagaggtgc tgcaagccct gaaacagctt cagctgctca
3121 aggcccagtt agacgcagct gtgaccttcg gccccagcca ggtggcccgg ggcgaggacc
3181 ccgccctgca gatctgctgt catcctggct gcaccccccg cccagcttgc tgccattgcc
3241 cagatcccca tgcctgaggg cccctcggct ggcctgggca tgtgatggct cctcactggg
3301 agttgtgggg gaggctcagg tgtctggagg gggtttgtgc ctgataacgt aataacacca
3361 gtggagactt gcagctgtga caattcgacc aatcctgggg taatgctgtg tgctggtgcc
3421 ggtcccctgt ggtacgaatg aggaaactga ggtgcagaga ggttcaggac ttgtacaaga
3481 tcacccagcc agaaagaggt tgggctggga tttgaaccct ggtgtcgtgg ctctggaagc
3541 tgccctggcg ctccttggtg atctgcgtgg gtctgtgcac acaggcacac gtcagccaca
3601 aggcacatgg acgagcgcac gtgcttgagt gcaggac
//