[bionet.molbio.genbank.updates] Porcine gene for plasminogen activator

GenBank-Updates@genbank.bio.net (05/22/91)

LOCUS       PIGUPAG      7143 bp ds-DNA             MAM       22-MAY-1991
DEFINITION  Porcine gene for plasminogen activator
ACCESSION   X01648
KEYWORDS    plasminogen activator; urokinase.
SOURCE      Sus scrofa DNA.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 7143)
  AUTHORS   Nagamine,Y., Pearson,D., Altus,M.S. and Reich,E.
  TITLE     cDNA and gene nucleotide sequence of porcine plasminogen activator
  JOURNAL   Nucleic Acids Res. 12, 9525-9541 (1984)
  STANDARD  full automatic
COMMENT     EPD; 14053; Ss urokinase (uPA). SWISS-PROT; P04185; UROK$PIG.
            
            From EMBL 26   entry SSUPAG;  dated 06-JUL-1989.
FEATURES             Location/Qualifiers
     promoter        48..55
                     /note="pot. TATA-box"
     promoter        233..240
                     /note="pot. TATA-box"
     misc_feature    567..571
                     /note="ggggc pentanucleotide 1"
     misc_feature    581..608
                     /note="28 bp sequence homologous to rat tyrosine
                     aminotransferase"
     misc_feature    633..637
                     /note="pentanucleotide 2"
     misc_feature    644..648
                     /note="pentanucleotide 3"
     misc_feature    706..710
                     /note="pentanucleotide 4"
     misc_feature    718..722
                     /note="pentanucleotide 5"
     misc_feature    739..743
                     /note="pentanucleotide 6"
     misc_feature    845..849
                     /note="pentanucleotide 7"
     misc_feature    915..919
                     /note="pentanucleotide 8"
     misc_feature    920..924
                     /note="pentanucleotide 9"
     misc_feature    926..930
                     /note="pentanucleotide 10"
     misc_feature    932..936
                     /note="pentanucleotide 11"
     misc_feature    938..943
                     /note="pot. glucocorticoid receptor binding site"
     promoter        943..949
                     /note="TATA-box"
     repeat_region   948..957
                     /note="decanucleotide direct repeat 1"
     precursor_RNA   975..6826
                     /note="primary transcript"
     mRNA            975..1069
                     /note="exon 1"
     misc_feature    977..977
                     /note="pot. alternative transcription start"
     misc_feature    991..996
                     /note="pot. glucocorticoid receptor binding site"
     repeat_region   996..1005
                     /note="decanucleotide direct repeat 1'"
     intron          1070..1388
                     /note="intron I"
     mRNA            1389..1476
                     /note="exon 2"
     CDS             1420..1476
                     /note="pot. signal peptide (aa -20 to -2)"
                     /codon_start=1420
     intron          1477..1928
                     /note="intron II"
     mRNA            1929..1962
                     /note="exon 3"
     CDS             1929..1931
                     /note="pot. signal peptide (aa -1)"
                     /codon_start=1929
     CDS             1932..1962
                     /note="pot. pro-plasminogen activator (aa 1-10)"
                     /codon_start=1932
     intron          1963..2119
                     /note="intron III"
     mRNA            2120..2227
                     /note="exon 4"
     CDS             2120..2227
                     /note="pot. pro-plasminogen activator (aa 11-46) (2120 is
                     2nd base in codon) (2227 is 1st base in codon)"
                     /codon_start=2120
     intron          2228..2556
                     /note="intron IV"
     mRNA            2557..2731
                     /note="exon 5"
     CDS             2557..2731
                     /note="pot. pro-plasminogen activator (aa 47-105) (2557 is
                     2nd base in codon) (2731 is 2nd base in codon)"
                     /codon_start=2557
     intron          2732..2918
                     /note="intron V"
     mRNA            2919..3037
                     /note="exon 6"
     CDS             2919..3037
                     /note="pot. pro-plasminogen activator (aa 106-144) site
                     (2919 is 3rd base in codon) (3037 is 1st base in codon)"
                     /codon_start=2919
     misc_feature    2998..3000
                     /note="aa 132 (asp) pot. glycosylation site"
     intron          3038..3197
                     /note="intron VI"
     mRNA            3198..3418
                     /note="exon 7"
     CDS             3198..3418
                     /note="pot. pro-plasminogen activator (aa 145-218) (3198
                     is 2nd base in codon) (3418 is 2nd base in codon)"
                     /codon_start=3198
     misc_feature    3270..3272
                     /note="aa 169 (lys) is last aa of pot. proenzyme"
     misc_feature    3273..3275
                     /note="aa 170 (ile) is first aa of pot. mature plasminogen
                     activator (PA)"
     misc_feature    3408..3410
                     /note="aa 215 (his) pot. active site of mature PA"
     intron          3419..3643
                     /note="intron VII"
     mRNA            3644..3792
                     /note="exon 8"
     CDS             3786..3788
                     /note="aa 266 (asp) pot. active site of mature PA"
                     /codon_start=3786
     intron          3793..4436
                     /note="intron VIII"
     mRNA            4437..4577
                     /note="exon 9"
     CDS             4437..4577
                     /note="pot. pro-plasminogen activator (aa 268-314) (4437
                     is 2nd base in codon) (4577 is 1st base in codon)"
                     /codon_start=4437
     intron          4578..4903
                     /note="intron IX"
     mRNA            4904..5052
                     /note="exon 10"
     CDS             4904..5052
                     /note="pot. pro-plasminogen activator (aa 315-364) (4904
                     is 2nd base in codon)"
                     /codon_start=4904
     misc_feature    5020..5022
                     /note="aa 353 (ala) pot. glycosylation site"
     intron          5053..5707
                     /note="intron X"
     mRNA            5708..6826
     CDS             5708..5881
                     /note="pot. pro-plasminogen activator (aa 365-422)"
                     /codon_start=5708
     misc_feature    5714..5716
                     /note="aa 367 (ser) pot. active site of mature PA"
     polyA_site      6826..6826
                     /note="pot. polyadenylation site"
BASE COUNT     1733 a   1746 c   1974 g   1690 t
ORIGIN
        1 ccctcagttc cctaacccct ctcctcagag gtaaaaagaa aactttctat attaatcaaa
       61 ctttaccttt ccattaatca agactttaca ctttctagtc tttattaagt gtctatacta
      121 tggcccaggc aaactaatga ttggttcatt aaaggaactt tctgaaagac ctatcaacac
      181 ttcaaaagaa agtgtcagca gtctgtgccc taagactttg ttatcacaat cctataatat
      241 cataccagat gagccctaat tcttatccag ccctcatgtt caaagacctc ccttaaacca
      301 aactgccaat tcccaatgaa atccatgccc ccccccaccc caacaccaca agctatcttg
      361 ctgcagtgag caacgggttc agctagcgtt attcacttag ggggagctgg catacaaccg
      421 aggcagcccg ggtgagcagg ggggttttgt ccgtttcaga gagcatgagc atgtgtcagg
      481 agtattttca cattgagaaa gagacttcac agcgctgaga actatgcccg tatacccagg
      541 ggtccgaatc actctcgtag gcagctgggg ctaagggtag aaagggtgag aaagagctga
      601 ttgaggggat ctgggaggca gcatcatagc tgggggcagg ggtagggcat ctcccaaatc
      661 gatcttcttt ttgtaattcg gggtttggtg gggaggtgct gctcaggggc gggacccagg
      721 gcaggtgaat gcgaggaggg ggcggggatt ttaggtgcct ctctttccct cagttcagac
      781 caatttatcc ctcccctggg aaccgctctg ccccctcaca gttaaggttg aggaagcccg
      841 tgggggggcg gtccgagtca gagctggcct gcagggaaga ggagggaagg gagtggatgg
      901 gaagatccca ggctagggcg gggccagggc tggggcgagt cctaatatag agcctgcact
      961 gcgggcttag gagcacagcg cggagactga agtcctagag cctgccgagc atcagagtgc
     1021 ctactagtcc ccgctgtccc atacaggcca tagtcgaggg tgagtgtggg ccaccctaag
     1081 agcacagggt ggatgcaggc agcccccccc tgccggcttc acttccccct accgctggcc
     1141 cgctcggcag cgcttcgcgg ggtcaccgcg actctgtgcc cagcgcacag gagtccttct
     1201 gcgtggcgga tccgagctgt gccgcgatcc ctgagtctcc agagaggagg gacggtcagg
     1261 ttcggggaac ctggtcaccg cgggctcatc ctgcagggga ccgtgactcc tgcccccaac
     1321 tgcagtaacc cagcctgtcc gccttcgcgt ttctcccctt cttcccctga cttctccttc
     1381 ccttgcagag ccgctgtcta gagcccaagc ctcgccagca tgagagtcct gcgggcgtgc
     1441 ctgtccctct gtgtcctggt cgtgagcgac tccaaagtga gtggcttctg tgctttgact
     1501 cttggcggcg ggagggggct tgcaagaccc ctgaacaggg cccgggaaag gaaggggctg
     1561 cttagggagc tagggtcctc taaatcccat caacggcagg gccagaccct ccctgggaaa
     1621 tagggcaggt gtgacattgg ggtgttgaga accaagtgag ctctcagtgg ctggcagggg
     1681 agaaagaagc cagggactgc cctgctctgc tggcacttga ttcgtgaagc ttgcttgagt
     1741 catccatttc tctctgctgg aaacctatga tctttcattt gagagctagg cagacacgaa
     1801 cggggtgaag agagagggaa ccagagggaa gggtgagctt gggggccagt ttatcctcac
     1861 ctggaaccgc agggcatgga acctttgttg aactttccct ttctctccct ccctactcat
     1921 ctcttcaggg cagccatgaa cttcatcaag agtctggtgc atgtgagtat ccaccccttg
     1981 cacaatatct gcttgcactg atatcttgga aaagcctcag ggggcagccc tccctttacc
     2041 agcaagagga ctggctccct gattgcttcc tcccacactc cttgcttacc ccccaccccc
     2101 caacctttgt gttctgcagc gaactgtggc tgtctgaatg gaggaaaatg tgtgtcctac
     2161 aagtacttct ccaacattca gcgatgcagc tgcccaaaga aattccaagg ggagcactgt
     2221 gagataggta tgtggatcct gattctaact ggagaggagg aggcaccagg gattgtgggg
     2281 cagggagaca tgggtgggat gcaagagcag gcaggcgtta ggagttgggg gtaaaaagga
     2341 ggggggcatc tttgttccca gtgatatata gtcaaacaca aacatgcact atctcatgaa
     2401 gctgtggctg cacaaatggg aggtggggat ggaaagaaga ccctttctag tgtcttctgc
     2461 ctagcctgaa atcatgtgag gcctggaagg tcctctcaaa tgcctgtctc aacttcctcc
     2521 tctttctaat attctcatcc tcacatcctt ccatagacac atcgcaaacc tgctttgagg
     2581 ggaacggtca ctcttacaga gggaaggcca ataccaacac tggaggccgg ccctgcctgc
     2641 cctggaactc tgccactgtc cttctgaaca cgtaccatgc ccacagacct gacgccctgc
     2701 agctgggcct ggggaaacac aattactgca ggtgaggtgg gggtggcaag gaccctctgc
     2761 atcacttcac agaaaccctc attaccatcc tttttgtttt ccgagtgctg gtcagagcac
     2821 gagaatatca aggcctctgg cgagtcttcc ctggaggggg aagatgcaga aaaggcactc
     2881 tggattggaa tgacccccgt ctcccctcta ttttgcagga acccagacaa tcagagaaga
     2941 ccctggtgct acgtgcaggt tggcctgaag cagcttgtcc aagagtgcat ggtgcccaac
     3001 tgctctggtg gtgagagtca ccggcctgct tatgatggtg ggtagaaagg gacaaactca
     3061 tgtgtgttct cttagtccat cacaggaggg atgaggaggg aggcctgact ggtcctgaaa
     3121 acagggaggt cagaggacca ggagagagac acttgatgct acttcccttc cctaaagttg
     3181 cctttttctt tcctccagga aaaaatccct tctctactcc ggaaaaagta gagtttcagt
     3241 gtggccagaa ggctctgagg ccccgcttta agattgttgg gggaaaaagc accaccatcg
     3301 agaaccagcc ttggtttgca gccatctata gaaggcatcg tggaggctct gtcacctatg
     3361 tgtgtggtgg cagcctcatc agtccctgct gggtggtcag cgccacccac tgcttcatgt
     3421 atgtcttcat gttctgtctc ttctccctga ccctcctgcc ctaccccaaa taagtccctt
     3481 tctccttccc aacaaaagag ttcccttatg tctacccctc agcccctttc catatggccc
     3541 atgactttgg ggacaagtga tgctctgagg ttgctgtggt ggggagagag aagtgacagg
     3601 atctcatgag atcagaccat ctgacagatc tctcctccca cagcaattac catcaaaagg
     3661 aagactacat tgtctacctg ggtcggcaaa cccttcactc cagcactcat ggggagatga
     3721 aatttgaggt ggaaaagctc atcttgcatg aggactacag tgctgacagc cttgctcacc
     3781 acaatgatat tggtgagtag aaaccttcat ctgtaaaaag aaaaagaaaa agaaaacact
     3841 tatctgccag aatgatgatg gtggggggag aaggatccaa gaagagatcc aagtgggagg
     3901 ttggagttgt agggaacttg aagagtctac tttaccaaca gagggggtgg aggggaaggg
     3961 tccagcatga catacgtgag gggcctggtg ctcctctgta gaggccctga atttccaaac
     4021 aggtagcctt ctctggaggg caatggcccg aaggtgtgta gcttggactg gatgttcttt
     4081 ccattgttga atggagtctg ttccaggata tagaacttgg agagagtgtt gggctggatt
     4141 tcagcccagc tacctcagac agggattttc tagaaaacag aacagaacaa caacccatac
     4201 agctgtatgc agcagccctg gctgtccaag tctttgtcaa cagctggaaa aaagccctga
     4261 ggcatgggac aagggagatt tattttgggt gatgaactac caacagactt ccctaggttg
     4321 acctctaacc ctgacgtcag aatagatatc cactccctca gggtttgaag gggagagatg
     4381 gtgaccacct caccaggtgg tgatctttcc tctctgacca cttccccctc ctccagcctt
     4441 gctgaagatc cgtaccgaca agggccagtg tgcacagcca tcccgctcca tacagaccat
     4501 ctgcctgccc ccagtgaatg gcgatgccca ttttggcgca agctgtgaaa tcgtcggctt
     4561 tggaaaagaa gatccctgtg agtgactttt gggtctggct gagagggtcc tggggaagtg
     4621 ctgtaacctg gaagtgagct cagcttgatt gagggagcac catggaggca gcagatgggt
     4681 caaggatgga gtggggagca ttgtttaggg aatgatgagc cataacgtta attgggtgag
     4741 gagtgaggga gtataggcgg gtaaaaacct agacctgggt ggaaaaagaa taaggacttt
     4801 ccctgctaag ggtacctttt ggtcctctcc ctgacagaga gtcccagtgt gcaggctgac
     4861 agacacatat taatgtaaat tcctccctgt atcctctgtc tagctgacta cctctatcca
     4921 gagcagctga aaatgactgt tgtgaagcta gtttcccacc gggagtgtca gcagccccat
     4981 tactacggca gcgaagtcac caccaaaatg ctgtgtgctg ctgacccaca gtggaaaacg
     5041 gattcctgcc aggtgagagt tccaagcatc tctttccata acccctatat atctgaagga
     5101 tcctgggcct ctctctacca gctttgagga gcccctctcc agccaaagcc ccaagaagcc
     5161 aggaccaggg gctcaggtct tggaaagttt aaaccagtcc atatgtgttt accaaacatt
     5221 tgtccaagag cctggctcat gctcggcatt tcagacttag ggagggaggt ggggaaataa
     5281 agagcataag aaagagaaaa caatagaaaa taacataact cagtgctcag agtggagtgt
     5341 taagtataaa tgatccgatg tttaggagag ggagccgagt gccagctaga tggtcatggg
     5401 ggccaggggg gtgctttaga gggaatgtga ttcaatagga acttgaggag gcacagggga
     5461 aaaaagagca tttcagctga aagatacaaa catgagaagg agcatggcat attttggggt
     5521 gggaagaagt agggtctgcg aatatagatg gaatcgtggc caacaggact ttgagagcag
     5581 aggattttaa ccttgatgtg ctaggaaatg gggagccacg aagaactttg actccaggag
     5641 tgggaagatc ggatggtccc ctttgcactt ctcccagggc tcatcttttg tgtctctggc
     5701 ttaacaggga gactccgggg ggccactggt ctgctccacc caaggccgcc tgactctgac
     5761 tgggattgtg agctggggcc gtgaatgtgc catgaaggac aaacccggcg tctacacaag
     5821 ggtctcacgc ttcctgacct ggatccacac tcatgttggg ggagagaatg gcctagccca
     5881 ctgagggccc ccagagaacc aagggaagag aggggcacca cccattccca tgctgactgt
     5941 caagtttttg cagtaaggcc atctgcacag ctgtatataa ggaagagact gaggaagatg
     6001 ggctctgcag agatggtttg cttgggctgc ccaccagggt gagcgactgt cgctttactc
     6061 tcagatacaa gtctgggtgc tgggcaccca gattcccccc tggccaggat ggaagggtgg
     6121 tcctgaacca gggtggtatt atcgttgtat ggactgaagc cacctggagt gaaacatggc
     6181 atcttccgtg cataggtgag gagagcctgc tcccctgagt gggccattca cgaggcccac
     6241 tgttgggaaa tgaagaattt cccaattagg aagtgtgaca gaactgaggt ctcttgagag
     6301 agcttggcca atgcaggaac agtggtttgg ggagtagaaa cactaatgat ttgagggaag
     6361 ggctctgaca ttccatgaat gtatcaggaa atgttatatg cgtgtgtgtg tttgcacatc
     6421 ttgttcacag gctgtgtcag tgtaagagcc ggtgttctgt gcctgacagc aagtctagat
     6481 atttccccaa attgtgtaga ctgtgatgtc acatagaatg gtcggtttca agacgtcgtg
     6541 ggtcactcct agggcctctt gggtctccta tgtgatacat ctaaatgtat catcctgggg
     6601 cactgactgt gaccagcact caatttccag tatcactttc acgtagatgt atgtttcttg
     6661 gccagttacc ctttctgacc ttccagccaa gttcatccaa tcctcactga gtgaggtgag
     6721 gaccactcct gtacactgag tatttaataa ttatgttcta ctatttttat ttatatctat
     6781 ttttataatt ttgaataaag atgatcaata aaacgtgatt tttctgaaga tattggctct
     6841 tcctggtgct tgagagggcg ttgggggtat aaaaagtaga aaatgacgat gtggcgtgca
     6901 ccccagggtt tcctgtgggg ctggatctct ggacttaatg gggctttggg aagaagaggt
     6961 taagagagtt gtagagcaga ctctcccctg catctaaggt acaaaaatgt gccctgaact
     7021 agaagcccac agcttggtaa atggcgtgca ggctttgtcg actttcaatt gcctcctcag
     7081 tctgatgcct ctgtgcagta gacatcttgt ccaactgcta catggccatc ctcttcagga
     7141 tcc
//