[bionet.molbio.genbank.updates] Human DNA for arylsulphatase A

GenBank-Updates@genbank.bio.net (05/26/91)

LOCUS       HUMARYLA     3637 bp ds-DNA             PRI       26-MAY-1991
DEFINITION  Human DNA for arylsulphatase A (EC 3.1.6.1)
ACCESSION   X52150
KEYWORDS    arylsulphatase; lysosomal enzyme.
SOURCE      Homo sapiens DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3637)
  AUTHORS   Kreysing,J.
  JOURNAL   Unpublished (1990)
  STANDARD  full automatic
REFERENCE   2  (bases 1 to 650)
  AUTHORS   Kreysing,J., von,F.K. and Gieselmann,V.
  TITLE     Structure of the arylsulfatase A gene
  JOURNAL   Eur. J. Biochem. 191, 627-631 (1990)
  STANDARD  full automatic
COMMENT     *source: cell type=leukocytes; library=EMBL-3; clone=G1/1; **map:
            chromosome=22; See <J04593> for mRNA sequence of arylsulphatase A.
            
            Data kindly reviewed (02-NOV-1990) by Hall L.
            
            From EMBL    entry HSARYLA;  dated 31-DEC-1990.
FEATURES             Location/Qualifiers
     misc_feature    191..200
                     /note="GC-box 1"
     misc_feature    201..210
                     /note="GC-box 2"
     misc_feature    213..222
                     /note="GC-box 3"
     misc_feature    240..249
                     /note="GC-box 4"
     precursor_RNA   256..3356
                     /note="primary transcript"
     mRNA            256..847
                     /note="exon 1"
     CDS             630..847
                     /note="arylsulphatase A (AA 1-73) (847 is 2nd base in
                     codon)"
                     /codon_start=630
     intron          848..996
                     /note="intron I"
     mRNA            997..1237
                     /note="exon 2"
     CDS             997..1237
                     /note="arylsulphatase A (AA 74-153) (997 is 3rd base in
                     codon)"
                     /codon_start=997
     intron          1238..1351
                     /note="intron II"
     mRNA            1352..1570
                     /note="exon 3"
     CDS             1352..1570
                     /note="arylsulphatase A (AA 154-226)"
                     /codon_start=1352
     intron          1571..1644
                     /note="intron III"
     mRNA            1645..1814
                     /note="exon 4"
     CDS             1645..1814
                     /note="arylsulphatase A (AA 227-283) (1814 is 2nd base in
                     codon)"
                     /codon_start=1645
     intron          1815..2126
                     /note="intron IV"
     mRNA            2127..2251
                     /note="exon 5"
     CDS             2127..2251
                     /note="arylsulphatase A (AA 284-324) (2127 is 3rd base in
                     codon) (2251 is 1st base in codon)"
                     /codon_start=2127
     intron          2252..2341
                     /note="intron V"
     mRNA            2342..2469
                     /note="exon 6"
     CDS             2342..2469
                     /note="arylsulphatase A (AA 325-367) (2342 is 2nd base in
                     codon)"
                     /codon_start=2342
     intron          2470..2719
                     /note="intron VI"
     mRNA            2720..2822
                     /note="exon 7"
     CDS             2720..2822
                     /note="arylsulphatase A (AA 368-401) (2822 is 1st base in
                     codon)"
                     /codon_start=2720
     intron          2823..2937
                     /note="intron VII"
     mRNA            2938..3356
                     /note="exon 8"
     CDS             2938..3254
                     /note="arylsulphatase A (AA 402-507) (2938 is 2nd base in
                     codon)"
                     /codon_start=2938
     misc_feature    3351..3356
                     /note="polyA signal"
BASE COUNT      566 a   1290 c   1107 g    674 t
ORIGIN
        1 agccgctcct cctctgagaa gctccggacc cgagaggaca ccgacactgc gcagcgccga
       61 gcccgcgcgc agcccggacg cctcagccag ggccgaccgc gcagaggaag ctcccagagc
      121 ccgtttcaag accgcagcca acagcctcag gcgcacacgg cggcctcgga gcgagcacgc
      181 gcagcaacgc ccctcgcccc ggcccgcccc cggccccgcc ccgcaagggt cacaggtcac
      241 ggggcggggc cgaggcggaa gcgcccgcag cccggtaccg gctcctcctg ggctccctct
      301 agcgccttcc ccccggcccg actgcctggt cagcgccaag tgacttacgc ccccgaccct
      361 gagcccggac cgctaggcga ggaggatcag atctccgctc gagaatctga aggtgccctg
      421 gtcctggagg agttccgtcc cagccctgcg gtctcccggt actgctcgcc ccggccctct
      481 ggagcttcag gaggcggccg tcagggtcgg ggagtatttg ggtccggggt ctcagggaag
      541 ggcggcgcct gggtctgcgg tatcggaaag agcctgctgg agccaagtag ccctccctct
      601 cttgggacag acccctcggt cccatgtcca tgggggcacc gcggtccctc ctcctggccc
      661 tggctgctgg cctggccgtt gcccgtccgc ccaacatcgt gctgatcttt gccgacgacc
      721 tcggctatgg ggacctgggc tgctatgggc accccagctc taccactccc aacctggacc
      781 agctggcggc gggagggctg cggttcacag acttctacgt gcctgtgtct ctgtgcacac
      841 cctctaggta aagagggggc cgcgcctctt ccccgccccg accctccatc cctttcctcc
      901 caatggattg caggggggcg ggaaaaacgt ctgtctctct ctctagggaa ggccacattt
      961 ctgtctgtct cagggactct gtgacttgtc ccgcagggcc gccctcctga ccggccggct
     1021 cccggttcgg atgggcatgt accctggcgt cctggtgccc agctcccggg ggggcctgcc
     1081 cctggaggag gtgaccgtgg ccgaagtcct ggctgcccga ggctacctca caggaatggc
     1141 cggcaagtgg caccttgggg tggggcctga gggggccttc ctgccccccc atcagggctt
     1201 ccatcgattt ctaggcatcc cgtactccca cgaccaggta ggaaccaccc gggccctcag
     1261 ccaccctccc acctcccaaa gtcccccagc cccttgactg tcccgcagcc ccacctgcca
     1321 gcccagccct cacggcagct gcccgcctca gggcccctgc cagaacctga cctgcttccc
     1381 gccggccact ccttgcgacg gtggctgtga ccagggcctg gtccccatcc cactgttggc
     1441 caacctgtcc gtggaggcgc agcccccctg gctgcccgga ctagaggccc gctacatggc
     1501 tttcgcccat gacctcatgg ccgacgccca gcgccaggat cgccccttct tcctgtacta
     1561 tgcctctcac gtaagtgatc ttggcccaac cccctggctg cccgttgacc cctacccagt
     1621 gctaactcca gtctttgccc ccagcacacc cactaccctc agttcagtgg gcagagcttt
     1681 gcagagcgtt caggccgcgg gccatttggg gactccctga tggagctgga tgcagctgtg
     1741 gggaccctga tgacagccat aggggacctg gggctgcttg aagagacgct ggtcatcttc
     1801 actgcagaca atgggtatgc cagcagggca gctgggtgct ccggccctgt cacgggccag
     1861 ggcctggagg ccttgcagtt cagctgcttg ccaagaacat agtgggtgag ggggtgccag
     1921 gagatgctgg ccacgttgca ggggcccaag gtgtagtcag gagacacagg tgcacagaga
     1981 gctggtcttg gtaggcctgg gaggtgccgg gctcatgctg ggcacctccg ggcaagcttt
     2041 gtgacttaga ggtgtggggc cactggtcac cctcggtggc tcagaggctg tggctccatg
     2101 gctcatgagc gcctcctgtg tcccagacct gagaccatgc gtatgtcccg aggcggctgc
     2161 tccggtctct tgcggtgtgg aaagggaacg acctacgagg gcggtgtccg agagcctgcc
     2221 ttggccttct ggccaggtca tatcgctccc ggtcagtccg caggccctct ccttggaacc
     2281 ctggccccac caccccaacc ttgatggcga actgagtgac tgaccagcct cctgccccca
     2341 ggcgtgaccc acgagctggc cagctccctg gacctgctgc ctaccctggc agccctggct
     2401 ggggccccac tgcccaatgt caccttggat ggctttgacc tcagccccct gctgctgggc
     2461 acaggcaagg tagggccggt gacccctgat cccagatcct tggcccctgt cctggccttc
     2521 ccctggggtg agtgtggcag tgctgagagt ctgtgcctca gtgcctcctg cactgagtgg
     2581 catccaagtg gcgccacctc tcaggttcct gggtgggcaa gaagcggtgc acgtccaggg
     2641 cctcccacca gggctggcag cccaggtatg tgcagtgctt gggcctgccc cgccccgtga
     2701 cccctgactc tgcccccaga gccctcggca gtctctcttc ttctacccgt cctacccaga
     2761 cgaggtccgt ggggtttttg ctgtgcggac tggaaagtac aaggctcact tcttcaccca
     2821 gggtaacccc tccccgtgga tccctccccc cgaacctgct gacccctccc cggagcccta
     2881 gatccctggc ccctcctctc gcccttgccc tgtgcacaga attggccccc tccccaggct
     2941 ctgcccacag tgataccact gcagaccctg cctgccacgc ctccagctct ctgactgctc
     3001 atgagccccc gctgctctat gacctgtcca aggaccctgg tgagaactac aacctgctgg
     3061 ggggtgtggc cggggccacc ccagaggtgc tgcaagccct gaaacagctt cagctgctca
     3121 aggcccagtt agacgcagct gtgaccttcg gccccagcca ggtggcccgg ggcgaggacc
     3181 ccgccctgca gatctgctgt catcctggct gcaccccccg cccagcttgc tgccattgcc
     3241 cagatcccca tgcctgaggg cccctcggct ggcctgggca tgtgatggct cctcactggg
     3301 agttgtgggg gaggctcagg tgtctggagg gggtttgtgc ctgataacgt aataacacca
     3361 gtggagactt gcagctgtga caattcgacc aatcctgggg taatgctgtg tgctggtgcc
     3421 ggtcccctgt ggtacgaatg aggaaactga ggtgcagaga ggttcaggac ttgtacaaga
     3481 tcacccagcc agaaagaggt tgggctggga tttgaaccct ggtgtcgtgg ctctggaagc
     3541 tgccctggcg ctccttggtg atctgcgtgg gtctgtgcac acaggcacac gtcagccaca
     3601 aggcacatgg acgagcgcac gtgcttgagt gcaggac
//