GenBank-Updates@genbank.bio.net (05/22/91)
LOCUS PIGUPAG 7143 bp ds-DNA MAM 22-MAY-1991
DEFINITION Porcine gene for plasminogen activator
ACCESSION X01648
KEYWORDS plasminogen activator; urokinase.
SOURCE Sus scrofa DNA.
ORGANISM Sus scrofa
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE 1 (bases 1 to 7143)
AUTHORS Nagamine,Y., Pearson,D., Altus,M.S. and Reich,E.
TITLE cDNA and gene nucleotide sequence of porcine plasminogen activator
JOURNAL Nucleic Acids Res. 12, 9525-9541 (1984)
STANDARD full automatic
COMMENT EPD; 14053; Ss urokinase (uPA). SWISS-PROT; P04185; UROK$PIG.
From EMBL 26 entry SSUPAG; dated 06-JUL-1989.
FEATURES Location/Qualifiers
promoter 48..55
/note="pot. TATA-box"
promoter 233..240
/note="pot. TATA-box"
misc_feature 567..571
/note="ggggc pentanucleotide 1"
misc_feature 581..608
/note="28 bp sequence homologous to rat tyrosine
aminotransferase"
misc_feature 633..637
/note="pentanucleotide 2"
misc_feature 644..648
/note="pentanucleotide 3"
misc_feature 706..710
/note="pentanucleotide 4"
misc_feature 718..722
/note="pentanucleotide 5"
misc_feature 739..743
/note="pentanucleotide 6"
misc_feature 845..849
/note="pentanucleotide 7"
misc_feature 915..919
/note="pentanucleotide 8"
misc_feature 920..924
/note="pentanucleotide 9"
misc_feature 926..930
/note="pentanucleotide 10"
misc_feature 932..936
/note="pentanucleotide 11"
misc_feature 938..943
/note="pot. glucocorticoid receptor binding site"
promoter 943..949
/note="TATA-box"
repeat_region 948..957
/note="decanucleotide direct repeat 1"
precursor_RNA 975..6826
/note="primary transcript"
mRNA 975..1069
/note="exon 1"
misc_feature 977..977
/note="pot. alternative transcription start"
misc_feature 991..996
/note="pot. glucocorticoid receptor binding site"
repeat_region 996..1005
/note="decanucleotide direct repeat 1'"
intron 1070..1388
/note="intron I"
mRNA 1389..1476
/note="exon 2"
CDS 1420..1476
/note="pot. signal peptide (aa -20 to -2)"
/codon_start=1420
intron 1477..1928
/note="intron II"
mRNA 1929..1962
/note="exon 3"
CDS 1929..1931
/note="pot. signal peptide (aa -1)"
/codon_start=1929
CDS 1932..1962
/note="pot. pro-plasminogen activator (aa 1-10)"
/codon_start=1932
intron 1963..2119
/note="intron III"
mRNA 2120..2227
/note="exon 4"
CDS 2120..2227
/note="pot. pro-plasminogen activator (aa 11-46) (2120 is
2nd base in codon) (2227 is 1st base in codon)"
/codon_start=2120
intron 2228..2556
/note="intron IV"
mRNA 2557..2731
/note="exon 5"
CDS 2557..2731
/note="pot. pro-plasminogen activator (aa 47-105) (2557 is
2nd base in codon) (2731 is 2nd base in codon)"
/codon_start=2557
intron 2732..2918
/note="intron V"
mRNA 2919..3037
/note="exon 6"
CDS 2919..3037
/note="pot. pro-plasminogen activator (aa 106-144) site
(2919 is 3rd base in codon) (3037 is 1st base in codon)"
/codon_start=2919
misc_feature 2998..3000
/note="aa 132 (asp) pot. glycosylation site"
intron 3038..3197
/note="intron VI"
mRNA 3198..3418
/note="exon 7"
CDS 3198..3418
/note="pot. pro-plasminogen activator (aa 145-218) (3198
is 2nd base in codon) (3418 is 2nd base in codon)"
/codon_start=3198
misc_feature 3270..3272
/note="aa 169 (lys) is last aa of pot. proenzyme"
misc_feature 3273..3275
/note="aa 170 (ile) is first aa of pot. mature plasminogen
activator (PA)"
misc_feature 3408..3410
/note="aa 215 (his) pot. active site of mature PA"
intron 3419..3643
/note="intron VII"
mRNA 3644..3792
/note="exon 8"
CDS 3786..3788
/note="aa 266 (asp) pot. active site of mature PA"
/codon_start=3786
intron 3793..4436
/note="intron VIII"
mRNA 4437..4577
/note="exon 9"
CDS 4437..4577
/note="pot. pro-plasminogen activator (aa 268-314) (4437
is 2nd base in codon) (4577 is 1st base in codon)"
/codon_start=4437
intron 4578..4903
/note="intron IX"
mRNA 4904..5052
/note="exon 10"
CDS 4904..5052
/note="pot. pro-plasminogen activator (aa 315-364) (4904
is 2nd base in codon)"
/codon_start=4904
misc_feature 5020..5022
/note="aa 353 (ala) pot. glycosylation site"
intron 5053..5707
/note="intron X"
mRNA 5708..6826
CDS 5708..5881
/note="pot. pro-plasminogen activator (aa 365-422)"
/codon_start=5708
misc_feature 5714..5716
/note="aa 367 (ser) pot. active site of mature PA"
polyA_site 6826..6826
/note="pot. polyadenylation site"
BASE COUNT 1733 a 1746 c 1974 g 1690 t
ORIGIN
1 ccctcagttc cctaacccct ctcctcagag gtaaaaagaa aactttctat attaatcaaa
61 ctttaccttt ccattaatca agactttaca ctttctagtc tttattaagt gtctatacta
121 tggcccaggc aaactaatga ttggttcatt aaaggaactt tctgaaagac ctatcaacac
181 ttcaaaagaa agtgtcagca gtctgtgccc taagactttg ttatcacaat cctataatat
241 cataccagat gagccctaat tcttatccag ccctcatgtt caaagacctc ccttaaacca
301 aactgccaat tcccaatgaa atccatgccc ccccccaccc caacaccaca agctatcttg
361 ctgcagtgag caacgggttc agctagcgtt attcacttag ggggagctgg catacaaccg
421 aggcagcccg ggtgagcagg ggggttttgt ccgtttcaga gagcatgagc atgtgtcagg
481 agtattttca cattgagaaa gagacttcac agcgctgaga actatgcccg tatacccagg
541 ggtccgaatc actctcgtag gcagctgggg ctaagggtag aaagggtgag aaagagctga
601 ttgaggggat ctgggaggca gcatcatagc tgggggcagg ggtagggcat ctcccaaatc
661 gatcttcttt ttgtaattcg gggtttggtg gggaggtgct gctcaggggc gggacccagg
721 gcaggtgaat gcgaggaggg ggcggggatt ttaggtgcct ctctttccct cagttcagac
781 caatttatcc ctcccctggg aaccgctctg ccccctcaca gttaaggttg aggaagcccg
841 tgggggggcg gtccgagtca gagctggcct gcagggaaga ggagggaagg gagtggatgg
901 gaagatccca ggctagggcg gggccagggc tggggcgagt cctaatatag agcctgcact
961 gcgggcttag gagcacagcg cggagactga agtcctagag cctgccgagc atcagagtgc
1021 ctactagtcc ccgctgtccc atacaggcca tagtcgaggg tgagtgtggg ccaccctaag
1081 agcacagggt ggatgcaggc agcccccccc tgccggcttc acttccccct accgctggcc
1141 cgctcggcag cgcttcgcgg ggtcaccgcg actctgtgcc cagcgcacag gagtccttct
1201 gcgtggcgga tccgagctgt gccgcgatcc ctgagtctcc agagaggagg gacggtcagg
1261 ttcggggaac ctggtcaccg cgggctcatc ctgcagggga ccgtgactcc tgcccccaac
1321 tgcagtaacc cagcctgtcc gccttcgcgt ttctcccctt cttcccctga cttctccttc
1381 ccttgcagag ccgctgtcta gagcccaagc ctcgccagca tgagagtcct gcgggcgtgc
1441 ctgtccctct gtgtcctggt cgtgagcgac tccaaagtga gtggcttctg tgctttgact
1501 cttggcggcg ggagggggct tgcaagaccc ctgaacaggg cccgggaaag gaaggggctg
1561 cttagggagc tagggtcctc taaatcccat caacggcagg gccagaccct ccctgggaaa
1621 tagggcaggt gtgacattgg ggtgttgaga accaagtgag ctctcagtgg ctggcagggg
1681 agaaagaagc cagggactgc cctgctctgc tggcacttga ttcgtgaagc ttgcttgagt
1741 catccatttc tctctgctgg aaacctatga tctttcattt gagagctagg cagacacgaa
1801 cggggtgaag agagagggaa ccagagggaa gggtgagctt gggggccagt ttatcctcac
1861 ctggaaccgc agggcatgga acctttgttg aactttccct ttctctccct ccctactcat
1921 ctcttcaggg cagccatgaa cttcatcaag agtctggtgc atgtgagtat ccaccccttg
1981 cacaatatct gcttgcactg atatcttgga aaagcctcag ggggcagccc tccctttacc
2041 agcaagagga ctggctccct gattgcttcc tcccacactc cttgcttacc ccccaccccc
2101 caacctttgt gttctgcagc gaactgtggc tgtctgaatg gaggaaaatg tgtgtcctac
2161 aagtacttct ccaacattca gcgatgcagc tgcccaaaga aattccaagg ggagcactgt
2221 gagataggta tgtggatcct gattctaact ggagaggagg aggcaccagg gattgtgggg
2281 cagggagaca tgggtgggat gcaagagcag gcaggcgtta ggagttgggg gtaaaaagga
2341 ggggggcatc tttgttccca gtgatatata gtcaaacaca aacatgcact atctcatgaa
2401 gctgtggctg cacaaatggg aggtggggat ggaaagaaga ccctttctag tgtcttctgc
2461 ctagcctgaa atcatgtgag gcctggaagg tcctctcaaa tgcctgtctc aacttcctcc
2521 tctttctaat attctcatcc tcacatcctt ccatagacac atcgcaaacc tgctttgagg
2581 ggaacggtca ctcttacaga gggaaggcca ataccaacac tggaggccgg ccctgcctgc
2641 cctggaactc tgccactgtc cttctgaaca cgtaccatgc ccacagacct gacgccctgc
2701 agctgggcct ggggaaacac aattactgca ggtgaggtgg gggtggcaag gaccctctgc
2761 atcacttcac agaaaccctc attaccatcc tttttgtttt ccgagtgctg gtcagagcac
2821 gagaatatca aggcctctgg cgagtcttcc ctggaggggg aagatgcaga aaaggcactc
2881 tggattggaa tgacccccgt ctcccctcta ttttgcagga acccagacaa tcagagaaga
2941 ccctggtgct acgtgcaggt tggcctgaag cagcttgtcc aagagtgcat ggtgcccaac
3001 tgctctggtg gtgagagtca ccggcctgct tatgatggtg ggtagaaagg gacaaactca
3061 tgtgtgttct cttagtccat cacaggaggg atgaggaggg aggcctgact ggtcctgaaa
3121 acagggaggt cagaggacca ggagagagac acttgatgct acttcccttc cctaaagttg
3181 cctttttctt tcctccagga aaaaatccct tctctactcc ggaaaaagta gagtttcagt
3241 gtggccagaa ggctctgagg ccccgcttta agattgttgg gggaaaaagc accaccatcg
3301 agaaccagcc ttggtttgca gccatctata gaaggcatcg tggaggctct gtcacctatg
3361 tgtgtggtgg cagcctcatc agtccctgct gggtggtcag cgccacccac tgcttcatgt
3421 atgtcttcat gttctgtctc ttctccctga ccctcctgcc ctaccccaaa taagtccctt
3481 tctccttccc aacaaaagag ttcccttatg tctacccctc agcccctttc catatggccc
3541 atgactttgg ggacaagtga tgctctgagg ttgctgtggt ggggagagag aagtgacagg
3601 atctcatgag atcagaccat ctgacagatc tctcctccca cagcaattac catcaaaagg
3661 aagactacat tgtctacctg ggtcggcaaa cccttcactc cagcactcat ggggagatga
3721 aatttgaggt ggaaaagctc atcttgcatg aggactacag tgctgacagc cttgctcacc
3781 acaatgatat tggtgagtag aaaccttcat ctgtaaaaag aaaaagaaaa agaaaacact
3841 tatctgccag aatgatgatg gtggggggag aaggatccaa gaagagatcc aagtgggagg
3901 ttggagttgt agggaacttg aagagtctac tttaccaaca gagggggtgg aggggaaggg
3961 tccagcatga catacgtgag gggcctggtg ctcctctgta gaggccctga atttccaaac
4021 aggtagcctt ctctggaggg caatggcccg aaggtgtgta gcttggactg gatgttcttt
4081 ccattgttga atggagtctg ttccaggata tagaacttgg agagagtgtt gggctggatt
4141 tcagcccagc tacctcagac agggattttc tagaaaacag aacagaacaa caacccatac
4201 agctgtatgc agcagccctg gctgtccaag tctttgtcaa cagctggaaa aaagccctga
4261 ggcatgggac aagggagatt tattttgggt gatgaactac caacagactt ccctaggttg
4321 acctctaacc ctgacgtcag aatagatatc cactccctca gggtttgaag gggagagatg
4381 gtgaccacct caccaggtgg tgatctttcc tctctgacca cttccccctc ctccagcctt
4441 gctgaagatc cgtaccgaca agggccagtg tgcacagcca tcccgctcca tacagaccat
4501 ctgcctgccc ccagtgaatg gcgatgccca ttttggcgca agctgtgaaa tcgtcggctt
4561 tggaaaagaa gatccctgtg agtgactttt gggtctggct gagagggtcc tggggaagtg
4621 ctgtaacctg gaagtgagct cagcttgatt gagggagcac catggaggca gcagatgggt
4681 caaggatgga gtggggagca ttgtttaggg aatgatgagc cataacgtta attgggtgag
4741 gagtgaggga gtataggcgg gtaaaaacct agacctgggt ggaaaaagaa taaggacttt
4801 ccctgctaag ggtacctttt ggtcctctcc ctgacagaga gtcccagtgt gcaggctgac
4861 agacacatat taatgtaaat tcctccctgt atcctctgtc tagctgacta cctctatcca
4921 gagcagctga aaatgactgt tgtgaagcta gtttcccacc gggagtgtca gcagccccat
4981 tactacggca gcgaagtcac caccaaaatg ctgtgtgctg ctgacccaca gtggaaaacg
5041 gattcctgcc aggtgagagt tccaagcatc tctttccata acccctatat atctgaagga
5101 tcctgggcct ctctctacca gctttgagga gcccctctcc agccaaagcc ccaagaagcc
5161 aggaccaggg gctcaggtct tggaaagttt aaaccagtcc atatgtgttt accaaacatt
5221 tgtccaagag cctggctcat gctcggcatt tcagacttag ggagggaggt ggggaaataa
5281 agagcataag aaagagaaaa caatagaaaa taacataact cagtgctcag agtggagtgt
5341 taagtataaa tgatccgatg tttaggagag ggagccgagt gccagctaga tggtcatggg
5401 ggccaggggg gtgctttaga gggaatgtga ttcaatagga acttgaggag gcacagggga
5461 aaaaagagca tttcagctga aagatacaaa catgagaagg agcatggcat attttggggt
5521 gggaagaagt agggtctgcg aatatagatg gaatcgtggc caacaggact ttgagagcag
5581 aggattttaa ccttgatgtg ctaggaaatg gggagccacg aagaactttg actccaggag
5641 tgggaagatc ggatggtccc ctttgcactt ctcccagggc tcatcttttg tgtctctggc
5701 ttaacaggga gactccgggg ggccactggt ctgctccacc caaggccgcc tgactctgac
5761 tgggattgtg agctggggcc gtgaatgtgc catgaaggac aaacccggcg tctacacaag
5821 ggtctcacgc ttcctgacct ggatccacac tcatgttggg ggagagaatg gcctagccca
5881 ctgagggccc ccagagaacc aagggaagag aggggcacca cccattccca tgctgactgt
5941 caagtttttg cagtaaggcc atctgcacag ctgtatataa ggaagagact gaggaagatg
6001 ggctctgcag agatggtttg cttgggctgc ccaccagggt gagcgactgt cgctttactc
6061 tcagatacaa gtctgggtgc tgggcaccca gattcccccc tggccaggat ggaagggtgg
6121 tcctgaacca gggtggtatt atcgttgtat ggactgaagc cacctggagt gaaacatggc
6181 atcttccgtg cataggtgag gagagcctgc tcccctgagt gggccattca cgaggcccac
6241 tgttgggaaa tgaagaattt cccaattagg aagtgtgaca gaactgaggt ctcttgagag
6301 agcttggcca atgcaggaac agtggtttgg ggagtagaaa cactaatgat ttgagggaag
6361 ggctctgaca ttccatgaat gtatcaggaa atgttatatg cgtgtgtgtg tttgcacatc
6421 ttgttcacag gctgtgtcag tgtaagagcc ggtgttctgt gcctgacagc aagtctagat
6481 atttccccaa attgtgtaga ctgtgatgtc acatagaatg gtcggtttca agacgtcgtg
6541 ggtcactcct agggcctctt gggtctccta tgtgatacat ctaaatgtat catcctgggg
6601 cactgactgt gaccagcact caatttccag tatcactttc acgtagatgt atgtttcttg
6661 gccagttacc ctttctgacc ttccagccaa gttcatccaa tcctcactga gtgaggtgag
6721 gaccactcct gtacactgag tatttaataa ttatgttcta ctatttttat ttatatctat
6781 ttttataatt ttgaataaag atgatcaata aaacgtgatt tttctgaaga tattggctct
6841 tcctggtgct tgagagggcg ttgggggtat aaaaagtaga aaatgacgat gtggcgtgca
6901 ccccagggtt tcctgtgggg ctggatctct ggacttaatg gggctttggg aagaagaggt
6961 taagagagtt gtagagcaga ctctcccctg catctaaggt acaaaaatgt gccctgaact
7021 agaagcccac agcttggtaa atggcgtgca ggctttgtcg actttcaatt gcctcctcag
7081 tctgatgcct ctgtgcagta gacatcttgt ccaactgcta catggccatc ctcttcagga
7141 tcc
//