GenBank-Updates@genbank.bio.net (05/22/91)
LOCUS PIGUPAG 7143 bp ds-DNA MAM 22-MAY-1991 DEFINITION Porcine gene for plasminogen activator ACCESSION X01648 KEYWORDS plasminogen activator; urokinase. SOURCE Sus scrofa DNA. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 7143) AUTHORS Nagamine,Y., Pearson,D., Altus,M.S. and Reich,E. TITLE cDNA and gene nucleotide sequence of porcine plasminogen activator JOURNAL Nucleic Acids Res. 12, 9525-9541 (1984) STANDARD full automatic COMMENT EPD; 14053; Ss urokinase (uPA). SWISS-PROT; P04185; UROK$PIG. From EMBL 26 entry SSUPAG; dated 06-JUL-1989. FEATURES Location/Qualifiers promoter 48..55 /note="pot. TATA-box" promoter 233..240 /note="pot. TATA-box" misc_feature 567..571 /note="ggggc pentanucleotide 1" misc_feature 581..608 /note="28 bp sequence homologous to rat tyrosine aminotransferase" misc_feature 633..637 /note="pentanucleotide 2" misc_feature 644..648 /note="pentanucleotide 3" misc_feature 706..710 /note="pentanucleotide 4" misc_feature 718..722 /note="pentanucleotide 5" misc_feature 739..743 /note="pentanucleotide 6" misc_feature 845..849 /note="pentanucleotide 7" misc_feature 915..919 /note="pentanucleotide 8" misc_feature 920..924 /note="pentanucleotide 9" misc_feature 926..930 /note="pentanucleotide 10" misc_feature 932..936 /note="pentanucleotide 11" misc_feature 938..943 /note="pot. glucocorticoid receptor binding site" promoter 943..949 /note="TATA-box" repeat_region 948..957 /note="decanucleotide direct repeat 1" precursor_RNA 975..6826 /note="primary transcript" mRNA 975..1069 /note="exon 1" misc_feature 977..977 /note="pot. alternative transcription start" misc_feature 991..996 /note="pot. glucocorticoid receptor binding site" repeat_region 996..1005 /note="decanucleotide direct repeat 1'" intron 1070..1388 /note="intron I" mRNA 1389..1476 /note="exon 2" CDS 1420..1476 /note="pot. signal peptide (aa -20 to -2)" /codon_start=1420 intron 1477..1928 /note="intron II" mRNA 1929..1962 /note="exon 3" CDS 1929..1931 /note="pot. signal peptide (aa -1)" /codon_start=1929 CDS 1932..1962 /note="pot. pro-plasminogen activator (aa 1-10)" /codon_start=1932 intron 1963..2119 /note="intron III" mRNA 2120..2227 /note="exon 4" CDS 2120..2227 /note="pot. pro-plasminogen activator (aa 11-46) (2120 is 2nd base in codon) (2227 is 1st base in codon)" /codon_start=2120 intron 2228..2556 /note="intron IV" mRNA 2557..2731 /note="exon 5" CDS 2557..2731 /note="pot. pro-plasminogen activator (aa 47-105) (2557 is 2nd base in codon) (2731 is 2nd base in codon)" /codon_start=2557 intron 2732..2918 /note="intron V" mRNA 2919..3037 /note="exon 6" CDS 2919..3037 /note="pot. pro-plasminogen activator (aa 106-144) site (2919 is 3rd base in codon) (3037 is 1st base in codon)" /codon_start=2919 misc_feature 2998..3000 /note="aa 132 (asp) pot. glycosylation site" intron 3038..3197 /note="intron VI" mRNA 3198..3418 /note="exon 7" CDS 3198..3418 /note="pot. pro-plasminogen activator (aa 145-218) (3198 is 2nd base in codon) (3418 is 2nd base in codon)" /codon_start=3198 misc_feature 3270..3272 /note="aa 169 (lys) is last aa of pot. proenzyme" misc_feature 3273..3275 /note="aa 170 (ile) is first aa of pot. mature plasminogen activator (PA)" misc_feature 3408..3410 /note="aa 215 (his) pot. active site of mature PA" intron 3419..3643 /note="intron VII" mRNA 3644..3792 /note="exon 8" CDS 3786..3788 /note="aa 266 (asp) pot. active site of mature PA" /codon_start=3786 intron 3793..4436 /note="intron VIII" mRNA 4437..4577 /note="exon 9" CDS 4437..4577 /note="pot. pro-plasminogen activator (aa 268-314) (4437 is 2nd base in codon) (4577 is 1st base in codon)" /codon_start=4437 intron 4578..4903 /note="intron IX" mRNA 4904..5052 /note="exon 10" CDS 4904..5052 /note="pot. pro-plasminogen activator (aa 315-364) (4904 is 2nd base in codon)" /codon_start=4904 misc_feature 5020..5022 /note="aa 353 (ala) pot. glycosylation site" intron 5053..5707 /note="intron X" mRNA 5708..6826 CDS 5708..5881 /note="pot. pro-plasminogen activator (aa 365-422)" /codon_start=5708 misc_feature 5714..5716 /note="aa 367 (ser) pot. active site of mature PA" polyA_site 6826..6826 /note="pot. polyadenylation site" BASE COUNT 1733 a 1746 c 1974 g 1690 t ORIGIN 1 ccctcagttc cctaacccct ctcctcagag gtaaaaagaa aactttctat attaatcaaa 61 ctttaccttt ccattaatca agactttaca ctttctagtc tttattaagt gtctatacta 121 tggcccaggc aaactaatga ttggttcatt aaaggaactt tctgaaagac ctatcaacac 181 ttcaaaagaa agtgtcagca gtctgtgccc taagactttg ttatcacaat cctataatat 241 cataccagat gagccctaat tcttatccag ccctcatgtt caaagacctc ccttaaacca 301 aactgccaat tcccaatgaa atccatgccc ccccccaccc caacaccaca agctatcttg 361 ctgcagtgag caacgggttc agctagcgtt attcacttag ggggagctgg catacaaccg 421 aggcagcccg ggtgagcagg ggggttttgt ccgtttcaga gagcatgagc atgtgtcagg 481 agtattttca cattgagaaa gagacttcac agcgctgaga actatgcccg tatacccagg 541 ggtccgaatc actctcgtag gcagctgggg ctaagggtag aaagggtgag aaagagctga 601 ttgaggggat ctgggaggca gcatcatagc tgggggcagg ggtagggcat ctcccaaatc 661 gatcttcttt ttgtaattcg gggtttggtg gggaggtgct gctcaggggc gggacccagg 721 gcaggtgaat gcgaggaggg ggcggggatt ttaggtgcct ctctttccct cagttcagac 781 caatttatcc ctcccctggg aaccgctctg ccccctcaca gttaaggttg aggaagcccg 841 tgggggggcg gtccgagtca gagctggcct gcagggaaga ggagggaagg gagtggatgg 901 gaagatccca ggctagggcg gggccagggc tggggcgagt cctaatatag agcctgcact 961 gcgggcttag gagcacagcg cggagactga agtcctagag cctgccgagc atcagagtgc 1021 ctactagtcc ccgctgtccc atacaggcca tagtcgaggg tgagtgtggg ccaccctaag 1081 agcacagggt ggatgcaggc agcccccccc tgccggcttc acttccccct accgctggcc 1141 cgctcggcag cgcttcgcgg ggtcaccgcg actctgtgcc cagcgcacag gagtccttct 1201 gcgtggcgga tccgagctgt gccgcgatcc ctgagtctcc agagaggagg gacggtcagg 1261 ttcggggaac ctggtcaccg cgggctcatc ctgcagggga ccgtgactcc tgcccccaac 1321 tgcagtaacc cagcctgtcc gccttcgcgt ttctcccctt cttcccctga cttctccttc 1381 ccttgcagag ccgctgtcta gagcccaagc ctcgccagca tgagagtcct gcgggcgtgc 1441 ctgtccctct gtgtcctggt cgtgagcgac tccaaagtga gtggcttctg tgctttgact 1501 cttggcggcg ggagggggct tgcaagaccc ctgaacaggg cccgggaaag gaaggggctg 1561 cttagggagc tagggtcctc taaatcccat caacggcagg gccagaccct ccctgggaaa 1621 tagggcaggt gtgacattgg ggtgttgaga accaagtgag ctctcagtgg ctggcagggg 1681 agaaagaagc cagggactgc cctgctctgc tggcacttga ttcgtgaagc ttgcttgagt 1741 catccatttc tctctgctgg aaacctatga tctttcattt gagagctagg cagacacgaa 1801 cggggtgaag agagagggaa ccagagggaa gggtgagctt gggggccagt ttatcctcac 1861 ctggaaccgc agggcatgga acctttgttg aactttccct ttctctccct ccctactcat 1921 ctcttcaggg cagccatgaa cttcatcaag agtctggtgc atgtgagtat ccaccccttg 1981 cacaatatct gcttgcactg atatcttgga aaagcctcag ggggcagccc tccctttacc 2041 agcaagagga ctggctccct gattgcttcc tcccacactc cttgcttacc ccccaccccc 2101 caacctttgt gttctgcagc gaactgtggc tgtctgaatg gaggaaaatg tgtgtcctac 2161 aagtacttct ccaacattca gcgatgcagc tgcccaaaga aattccaagg ggagcactgt 2221 gagataggta tgtggatcct gattctaact ggagaggagg aggcaccagg gattgtgggg 2281 cagggagaca tgggtgggat gcaagagcag gcaggcgtta ggagttgggg gtaaaaagga 2341 ggggggcatc tttgttccca gtgatatata gtcaaacaca aacatgcact atctcatgaa 2401 gctgtggctg cacaaatggg aggtggggat ggaaagaaga ccctttctag tgtcttctgc 2461 ctagcctgaa atcatgtgag gcctggaagg tcctctcaaa tgcctgtctc aacttcctcc 2521 tctttctaat attctcatcc tcacatcctt ccatagacac atcgcaaacc tgctttgagg 2581 ggaacggtca ctcttacaga gggaaggcca ataccaacac tggaggccgg ccctgcctgc 2641 cctggaactc tgccactgtc cttctgaaca cgtaccatgc ccacagacct gacgccctgc 2701 agctgggcct ggggaaacac aattactgca ggtgaggtgg gggtggcaag gaccctctgc 2761 atcacttcac agaaaccctc attaccatcc tttttgtttt ccgagtgctg gtcagagcac 2821 gagaatatca aggcctctgg cgagtcttcc ctggaggggg aagatgcaga aaaggcactc 2881 tggattggaa tgacccccgt ctcccctcta ttttgcagga acccagacaa tcagagaaga 2941 ccctggtgct acgtgcaggt tggcctgaag cagcttgtcc aagagtgcat ggtgcccaac 3001 tgctctggtg gtgagagtca ccggcctgct tatgatggtg ggtagaaagg gacaaactca 3061 tgtgtgttct cttagtccat cacaggaggg atgaggaggg aggcctgact ggtcctgaaa 3121 acagggaggt cagaggacca ggagagagac acttgatgct acttcccttc cctaaagttg 3181 cctttttctt tcctccagga aaaaatccct tctctactcc ggaaaaagta gagtttcagt 3241 gtggccagaa ggctctgagg ccccgcttta agattgttgg gggaaaaagc accaccatcg 3301 agaaccagcc ttggtttgca gccatctata gaaggcatcg tggaggctct gtcacctatg 3361 tgtgtggtgg cagcctcatc agtccctgct gggtggtcag cgccacccac tgcttcatgt 3421 atgtcttcat gttctgtctc ttctccctga ccctcctgcc ctaccccaaa taagtccctt 3481 tctccttccc aacaaaagag ttcccttatg tctacccctc agcccctttc catatggccc 3541 atgactttgg ggacaagtga tgctctgagg ttgctgtggt ggggagagag aagtgacagg 3601 atctcatgag atcagaccat ctgacagatc tctcctccca cagcaattac catcaaaagg 3661 aagactacat tgtctacctg ggtcggcaaa cccttcactc cagcactcat ggggagatga 3721 aatttgaggt ggaaaagctc atcttgcatg aggactacag tgctgacagc cttgctcacc 3781 acaatgatat tggtgagtag aaaccttcat ctgtaaaaag aaaaagaaaa agaaaacact 3841 tatctgccag aatgatgatg gtggggggag aaggatccaa gaagagatcc aagtgggagg 3901 ttggagttgt agggaacttg aagagtctac tttaccaaca gagggggtgg aggggaaggg 3961 tccagcatga catacgtgag gggcctggtg ctcctctgta gaggccctga atttccaaac 4021 aggtagcctt ctctggaggg caatggcccg aaggtgtgta gcttggactg gatgttcttt 4081 ccattgttga atggagtctg ttccaggata tagaacttgg agagagtgtt gggctggatt 4141 tcagcccagc tacctcagac agggattttc tagaaaacag aacagaacaa caacccatac 4201 agctgtatgc agcagccctg gctgtccaag tctttgtcaa cagctggaaa aaagccctga 4261 ggcatgggac aagggagatt tattttgggt gatgaactac caacagactt ccctaggttg 4321 acctctaacc ctgacgtcag aatagatatc cactccctca gggtttgaag gggagagatg 4381 gtgaccacct caccaggtgg tgatctttcc tctctgacca cttccccctc ctccagcctt 4441 gctgaagatc cgtaccgaca agggccagtg tgcacagcca tcccgctcca tacagaccat 4501 ctgcctgccc ccagtgaatg gcgatgccca ttttggcgca agctgtgaaa tcgtcggctt 4561 tggaaaagaa gatccctgtg agtgactttt gggtctggct gagagggtcc tggggaagtg 4621 ctgtaacctg gaagtgagct cagcttgatt gagggagcac catggaggca gcagatgggt 4681 caaggatgga gtggggagca ttgtttaggg aatgatgagc cataacgtta attgggtgag 4741 gagtgaggga gtataggcgg gtaaaaacct agacctgggt ggaaaaagaa taaggacttt 4801 ccctgctaag ggtacctttt ggtcctctcc ctgacagaga gtcccagtgt gcaggctgac 4861 agacacatat taatgtaaat tcctccctgt atcctctgtc tagctgacta cctctatcca 4921 gagcagctga aaatgactgt tgtgaagcta gtttcccacc gggagtgtca gcagccccat 4981 tactacggca gcgaagtcac caccaaaatg ctgtgtgctg ctgacccaca gtggaaaacg 5041 gattcctgcc aggtgagagt tccaagcatc tctttccata acccctatat atctgaagga 5101 tcctgggcct ctctctacca gctttgagga gcccctctcc agccaaagcc ccaagaagcc 5161 aggaccaggg gctcaggtct tggaaagttt aaaccagtcc atatgtgttt accaaacatt 5221 tgtccaagag cctggctcat gctcggcatt tcagacttag ggagggaggt ggggaaataa 5281 agagcataag aaagagaaaa caatagaaaa taacataact cagtgctcag agtggagtgt 5341 taagtataaa tgatccgatg tttaggagag ggagccgagt gccagctaga tggtcatggg 5401 ggccaggggg gtgctttaga gggaatgtga ttcaatagga acttgaggag gcacagggga 5461 aaaaagagca tttcagctga aagatacaaa catgagaagg agcatggcat attttggggt 5521 gggaagaagt agggtctgcg aatatagatg gaatcgtggc caacaggact ttgagagcag 5581 aggattttaa ccttgatgtg ctaggaaatg gggagccacg aagaactttg actccaggag 5641 tgggaagatc ggatggtccc ctttgcactt ctcccagggc tcatcttttg tgtctctggc 5701 ttaacaggga gactccgggg ggccactggt ctgctccacc caaggccgcc tgactctgac 5761 tgggattgtg agctggggcc gtgaatgtgc catgaaggac aaacccggcg tctacacaag 5821 ggtctcacgc ttcctgacct ggatccacac tcatgttggg ggagagaatg gcctagccca 5881 ctgagggccc ccagagaacc aagggaagag aggggcacca cccattccca tgctgactgt 5941 caagtttttg cagtaaggcc atctgcacag ctgtatataa ggaagagact gaggaagatg 6001 ggctctgcag agatggtttg cttgggctgc ccaccagggt gagcgactgt cgctttactc 6061 tcagatacaa gtctgggtgc tgggcaccca gattcccccc tggccaggat ggaagggtgg 6121 tcctgaacca gggtggtatt atcgttgtat ggactgaagc cacctggagt gaaacatggc 6181 atcttccgtg cataggtgag gagagcctgc tcccctgagt gggccattca cgaggcccac 6241 tgttgggaaa tgaagaattt cccaattagg aagtgtgaca gaactgaggt ctcttgagag 6301 agcttggcca atgcaggaac agtggtttgg ggagtagaaa cactaatgat ttgagggaag 6361 ggctctgaca ttccatgaat gtatcaggaa atgttatatg cgtgtgtgtg tttgcacatc 6421 ttgttcacag gctgtgtcag tgtaagagcc ggtgttctgt gcctgacagc aagtctagat 6481 atttccccaa attgtgtaga ctgtgatgtc acatagaatg gtcggtttca agacgtcgtg 6541 ggtcactcct agggcctctt gggtctccta tgtgatacat ctaaatgtat catcctgggg 6601 cactgactgt gaccagcact caatttccag tatcactttc acgtagatgt atgtttcttg 6661 gccagttacc ctttctgacc ttccagccaa gttcatccaa tcctcactga gtgaggtgag 6721 gaccactcct gtacactgag tatttaataa ttatgttcta ctatttttat ttatatctat 6781 ttttataatt ttgaataaag atgatcaata aaacgtgatt tttctgaaga tattggctct 6841 tcctggtgct tgagagggcg ttgggggtat aaaaagtaga aaatgacgat gtggcgtgca 6901 ccccagggtt tcctgtgggg ctggatctct ggacttaatg gggctttggg aagaagaggt 6961 taagagagtt gtagagcaga ctctcccctg catctaaggt acaaaaatgt gccctgaact 7021 agaagcccac agcttggtaa atggcgtgca ggctttgtcg actttcaatt gcctcctcag 7081 tctgatgcct ctgtgcagta gacatcttgt ccaactgcta catggccatc ctcttcagga 7141 tcc //