GenBank-Updates@genbank.bio.net (03/14/91)
LOCUS HUMPEM 4243 bp ds-DNA PRI 14-MAR-1991 DEFINITION Human polymorphic epithelial mucin (PEM) gene, complete cds. ACCESSION M61170 KEYWORDS polymorphic epithelial mucin. SOURCE Human blood lymphocyte cell and non-transformed helper T cell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4243) AUTHORS Lancaster,C.A., Peat,N., Duhig,T., Wilson,D., Taylor-Papadimitriou,J. and Gendler,S.J. TITLE Structure and expression of the human polymorphic epithelial mucin gene: An expressed VNTR unit JOURNAL Biochem. Biophys. Res. Commun. 173, 1019-1029 (1990) STANDARD simple staff_entry FEATURES Location/Qualifiers mRNA join(804..933,1436..2265,2365..2420,2570..2706,2851..2972, 3053..3202,3735..4112) /gene="PEM" CDS join(876..933,1436..2265,2365..2420,2570..2706,2851..2972, 3053..3202,3735..3809) /product="polymorphic epithelial mucin" /gene="PEM" /codon_start=876 exon 804..933 /number=1 /gene="PEM" exon 1436..2265 /number=2 /gene="PEM" exon 2365..2420 /number=3 /gene="PEM" exon 2570..2706 /number=4 /gene="PEM" exon 2851..2972 /number=5 /gene="PEM" exon 3053..3202 /number=6 /gene="PEM" exon 3735..4112 /number=7 /gene="PEM" BASE COUNT 853 a 1233 c 1268 g 889 t ORIGIN 1 tactcctctc cgcccggtcc gagcggcccc tcagcttgcg cggcccagcc ccgcaaggct 61 cccggtgacc actagagggc gggaggagct cctggccagt ggtggagagt ggcaaggaag 121 gaccctaggg ttcatcggag cccaggttta ctcccttaag tggaaatttc ttcccccact 181 cctccttggc tttctccaag gagggaaccc aggctgctgg aaagtccggc tggggggggg 241 actgtgggtt caggggagaa cggggtgtgg aacgggacag ggagcggtta gaagggtggg 301 gctattccgg gaagtggtgg ggggagggag cccaaaacta gcacctagtc cactcattat 361 ccagccctct tatttctcgg ccgctctgct tcagtggacc cggggagggc ggggaagtgg 421 agtgggagac ctaggggtgg gcttcccgac cttgctgtac aggacctcga cctagctggc 481 tttgttcccc atccccacgt tagttgttgc cctgaggcta aaactagagc ccaggggccc 541 caagttccag actgcccctc ccccctcccc cggagccagg gagtggttgg tgaaaggggg 601 aggccagctg gagaacaaac gggtagtcag ggggttgagc gattagagcc cttgtaccct 661 acccaggaat ggttggggag gaggaggaag aggtaggagg taggggaggg ggcggggttt 721 tgtcacctgt cacctgctcg ctgtgcctag ggcgggcggg cggggagtgg ggggaccggt 781 ataaagcggt aggcgcctgt gcccgctcca cctctcaagc agccagcgcc tgcctgaatc 841 tgttctgccc cctccccacc catttcacca ccaccatgac accgggcacc cagtctcctt 901 tcttcctgct gctgctcctc acagtgctta caggtgaggg gcacgaggtg gggagtgggc 961 tgccctgctt aggtggtctt cgtggtcttt ctgtgggttt tgctccctgg cagatggcac 1021 catgaagtta aggtaagaat tgcagacaga ggctgccctg tctgtgccag aaggagggag 1081 aggctaagga caggctgaga agagttgccc ccaaccctga gagtgggtac caggggcaag 1141 caaatgtcct gtagagaagt ctagggggaa gagagtaggg agagggaagg cttaagaggg 1201 gaagaaatgc aggggccatg agccaaggcc tatgggcaga gagaaggagg ctgctgcagg 1261 gaaggaggct tccaacccag gggttactga ggctgcccac tccccagtcc tcctggtatt 1321 atttctctgg tggccagagc ttatattttc ttcttgctct tatttttcct tcataaagac 1381 ccaaccctat gactttaact tcttacagct accacagccc ctaaacccgc aacagttgtt 1441 acaggttctg gtcatgcaag ctctacccca ggtggagaaa aggagacttc ggctacccag 1501 agaagttcag tgcccagctc tactgagaag aatgctgtga gtatgaccag cagcgtactc 1561 tccagccaca gccccggttc aggctcctcc accactcagg gacaggatgt cactctggcc 1621 ccggccacgg aaccagcttc aggttcagct gccacctggg gacaggatgt cacctcggtc 1681 ccagtcacca ggccagccct gggctccacc accccgccag cccacgatgt cacctcagcc 1741 ccggacaaca agccagcccc gggctccacc gcccccccag cccacggtgt cacctcggcc 1801 ccggacacca ggccggcccc gggctccacc gcccccccag cccatggtgt cacctcggcc 1861 ccggacaaca ggcccgcctt gggctccacc gcccctccag tccacaatgt cacctcggcc 1921 tcaggctctg catcaggctc agcttctact ctggtgcaca acggcacctc tgccagggct 1981 accacaaccc cagccagcaa gagcactcca ttctcaattc ccagccacca ctctgatact 2041 cctaccaccc ttgccagcca tagcaccaag actgatgcca gtagcactca ccatagcacg 2101 gtacctcctc tcacctcctc caatcacagc acttctcccc agttgtctac tggggtctct 2161 ttctttttcc tgtcttttca catttcaaac ctccagttta attcctctct ggaagatccc 2221 agcaccgact actaccaaga gctgcagaga gacatttctg aaatggtgag tatcggcctt 2281 tccttcccca tgctcccctg aagcagccat cagaactgtc cacacccttt gcatcaagcc 2341 cgagtccttt ccctctcacc ccagtttttg cagatttata aacaaggggg ttttctgggc 2401 ctctccaata ttaagttcag gtacagttct gggtgtggac ccagtgtggt ggttggaggg 2461 ttgggtggtg gtcatgaccg taggagggac tggtgcactt aaggttgggg gaagagtgct 2521 gagccagagc tgggacccgt ggctgaagtg cccatttccc tgtgaccagg ccaggatctg 2581 tggtggtaca attgactctg gccttccgag aaggtaccat caatgtccac gacgtggaga 2641 cacagttcaa tcagtataaa acggaagcag cctctcgata taacctgacg atctcagacg 2701 tcagcggtga ggctacttcc ctggctgcag ccagcaccat gccggggccc ctctccttcc 2761 agtgtctggg tccccgctct ttccttagtg ctggcagcgg gaggggcgcc tcctctggga 2821 gactgccctg accactgctt ttccttttag tgagtgatgt gccatttcct ttctctgccc 2881 agtctggggc tggggtgcca ggctggggca tcgcgctgct ggtgctggtc tgtgttctgg 2941 ttgcgctggc cattgtctat ctcattgcct tggtgagtgc agtccctggc cctgatcaga 3001 gccccccggt agaaggcact ccatggcctg ccataacctc ctatctcccc aggctgtctg 3061 tcagtgccgc cgaaagaact acgggcagct ggacatcttt ccagcccggg atacctacca 3121 tcctatgagc gagtacccca cctaccacac ccatgggcgc tatgtgcccc ctagcagtac 3181 cgatcgtagc ccctatgaga aggtgagatt ggccccacag gccaggggaa gcagagggtt 3241 tggctgggca aggattctga agggggtact tggaaaaccc aaagagcttg gaagaggtga 3301 gaagtggcgt gaagtgagca ggggagggcc tggcaaggat gaggggcaga ggtcagagga 3361 gttttggggg acaggcctgg gaggagacta tggaagaaag gggcctcaag agggagtggc 3421 cccactgcca gaattcctaa aaagatcatt ggccgtccac attcatgctg gctggcgctg 3481 gctgaactgg tgccaccgtg gcagttttgt tttgttttgc ttttttgcac ccagaggcaa 3541 aatgggtgga gcactatgcc caggggagcc cttcccgagg agtccagggg tgagcctctg 3601 tgatccccta atcaatctcc taggaatgga gggtagaccg agaaaaggct ggcatagggg 3661 gagtcagttt cccaggtaga agcaagaagg gtaccttttg ctcctcaccc tggatctctt 3721 ttccttccac ccaggtttct gcaggtaatg gtggcagcag cctctcttac acaaacccag 3781 cagtggcagc cacttctgcc aacttgtagg ggcacgtcgc ccgctgagct gagtggccag 3841 ccagtgccat tccactccac tcaggttctt cagggccaga gcccctgcac cctgtttggg 3901 ctggtgagct gggagttcag gtgggctgct cacacgtcct tcagaggccc caccaatttc 3961 tcggacactt ctcagtgtgt ggaagctcat gtgggcccct gaggctcatg cctgggaagt 4021 gttgtggtgg gggctcccag gaggactggc ccagagagcc ctgagatagc ggggatcctg 4081 aactggactg aataaaacgt ggtctcccac tggcgccaac ttctgatctt tcatctgtga 4141 cccgtgggca gcagggcgtc agaatgtgtg tgagggggct gggggaggag acagggaggc 4201 caggaggcag taaggagcga gtttgtttga gaagccaggg aga //
GenBank-Updates@genbank.bio.net (03/29/91)
LOCUS HUMPEM 4243 bp ds-DNA PRI 29-MAR-1991 DEFINITION Human polymorphic epithelial mucin (PEM) gene, complete cds. ACCESSION M61170 X54350 X54351 KEYWORDS polymorphic epithelial mucin. SOURCE Human blood lymphocyte cell and non-transformed helper T cell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4243) AUTHORS Lancaster,C.A., Peat,N., Duhig,T., Wilson,D., Taylor-Papadimitriou,J. and Gendler,S.J. TITLE Structure and expression of the human polymorphic epithelial mucin gene: An expressed VNTR unit JOURNAL Biochem. Biophys. Res. Commun. 173, 1019-1029 (1990) STANDARD simple staff_entry FEATURES Location/Qualifiers mRNA join(804..933,1436..2265,2365..2420,2570..2706,2851..2972, 3053..3202,3735..4112) /gene="PEM" CDS join(876..933,1436..2265,2365..2420,2570..2706,2851..2972, 3053..3202,3735..3809) /product="polymorphic epithelial mucin" /gene="PEM" /codon_start=876 exon 804..933 /number=1 /gene="PEM" exon 1436..2265 /number=2 /gene="PEM" exon 2365..2420 /number=3 /gene="PEM" exon 2570..2706 /number=4 /gene="PEM" exon 2851..2972 /number=5 /gene="PEM" exon 3053..3202 /number=6 /gene="PEM" exon 3735..4112 /number=7 /gene="PEM" BASE COUNT 853 a 1233 c 1268 g 889 t ORIGIN 1 tactcctctc cgcccggtcc gagcggcccc tcagcttgcg cggcccagcc ccgcaaggct 61 cccggtgacc actagagggc gggaggagct cctggccagt ggtggagagt ggcaaggaag 121 gaccctaggg ttcatcggag cccaggttta ctcccttaag tggaaatttc ttcccccact 181 cctccttggc tttctccaag gagggaaccc aggctgctgg aaagtccggc tggggggggg 241 actgtgggtt caggggagaa cggggtgtgg aacgggacag ggagcggtta gaagggtggg 301 gctattccgg gaagtggtgg ggggagggag cccaaaacta gcacctagtc cactcattat 361 ccagccctct tatttctcgg ccgctctgct tcagtggacc cggggagggc ggggaagtgg 421 agtgggagac ctaggggtgg gcttcccgac cttgctgtac aggacctcga cctagctggc 481 tttgttcccc atccccacgt tagttgttgc cctgaggcta aaactagagc ccaggggccc 541 caagttccag actgcccctc ccccctcccc cggagccagg gagtggttgg tgaaaggggg 601 aggccagctg gagaacaaac gggtagtcag ggggttgagc gattagagcc cttgtaccct 661 acccaggaat ggttggggag gaggaggaag aggtaggagg taggggaggg ggcggggttt 721 tgtcacctgt cacctgctcg ctgtgcctag ggcgggcggg cggggagtgg ggggaccggt 781 ataaagcggt aggcgcctgt gcccgctcca cctctcaagc agccagcgcc tgcctgaatc 841 tgttctgccc cctccccacc catttcacca ccaccatgac accgggcacc cagtctcctt 901 tcttcctgct gctgctcctc acagtgctta caggtgaggg gcacgaggtg gggagtgggc 961 tgccctgctt aggtggtctt cgtggtcttt ctgtgggttt tgctccctgg cagatggcac 1021 catgaagtta aggtaagaat tgcagacaga ggctgccctg tctgtgccag aaggagggag 1081 aggctaagga caggctgaga agagttgccc ccaaccctga gagtgggtac caggggcaag 1141 caaatgtcct gtagagaagt ctagggggaa gagagtaggg agagggaagg cttaagaggg 1201 gaagaaatgc aggggccatg agccaaggcc tatgggcaga gagaaggagg ctgctgcagg 1261 gaaggaggct tccaacccag gggttactga ggctgcccac tccccagtcc tcctggtatt 1321 atttctctgg tggccagagc ttatattttc ttcttgctct tatttttcct tcataaagac 1381 ccaaccctat gactttaact tcttacagct accacagccc ctaaacccgc aacagttgtt 1441 acaggttctg gtcatgcaag ctctacccca ggtggagaaa aggagacttc ggctacccag 1501 agaagttcag tgcccagctc tactgagaag aatgctgtga gtatgaccag cagcgtactc 1561 tccagccaca gccccggttc aggctcctcc accactcagg gacaggatgt cactctggcc 1621 ccggccacgg aaccagcttc aggttcagct gccacctggg gacaggatgt cacctcggtc 1681 ccagtcacca ggccagccct gggctccacc accccgccag cccacgatgt cacctcagcc 1741 ccggacaaca agccagcccc gggctccacc gcccccccag cccacggtgt cacctcggcc 1801 ccggacacca ggccggcccc gggctccacc gcccccccag cccatggtgt cacctcggcc 1861 ccggacaaca ggcccgcctt gggctccacc gcccctccag tccacaatgt cacctcggcc 1921 tcaggctctg catcaggctc agcttctact ctggtgcaca acggcacctc tgccagggct 1981 accacaaccc cagccagcaa gagcactcca ttctcaattc ccagccacca ctctgatact 2041 cctaccaccc ttgccagcca tagcaccaag actgatgcca gtagcactca ccatagcacg 2101 gtacctcctc tcacctcctc caatcacagc acttctcccc agttgtctac tggggtctct 2161 ttctttttcc tgtcttttca catttcaaac ctccagttta attcctctct ggaagatccc 2221 agcaccgact actaccaaga gctgcagaga gacatttctg aaatggtgag tatcggcctt 2281 tccttcccca tgctcccctg aagcagccat cagaactgtc cacacccttt gcatcaagcc 2341 cgagtccttt ccctctcacc ccagtttttg cagatttata aacaaggggg ttttctgggc 2401 ctctccaata ttaagttcag gtacagttct gggtgtggac ccagtgtggt ggttggaggg 2461 ttgggtggtg gtcatgaccg taggagggac tggtgcactt aaggttgggg gaagagtgct 2521 gagccagagc tgggacccgt ggctgaagtg cccatttccc tgtgaccagg ccaggatctg 2581 tggtggtaca attgactctg gccttccgag aaggtaccat caatgtccac gacgtggaga 2641 cacagttcaa tcagtataaa acggaagcag cctctcgata taacctgacg atctcagacg 2701 tcagcggtga ggctacttcc ctggctgcag ccagcaccat gccggggccc ctctccttcc 2761 agtgtctggg tccccgctct ttccttagtg ctggcagcgg gaggggcgcc tcctctggga 2821 gactgccctg accactgctt ttccttttag tgagtgatgt gccatttcct ttctctgccc 2881 agtctggggc tggggtgcca ggctggggca tcgcgctgct ggtgctggtc tgtgttctgg 2941 ttgcgctggc cattgtctat ctcattgcct tggtgagtgc agtccctggc cctgatcaga 3001 gccccccggt agaaggcact ccatggcctg ccataacctc ctatctcccc aggctgtctg 3061 tcagtgccgc cgaaagaact acgggcagct ggacatcttt ccagcccggg atacctacca 3121 tcctatgagc gagtacccca cctaccacac ccatgggcgc tatgtgcccc ctagcagtac 3181 cgatcgtagc ccctatgaga aggtgagatt ggccccacag gccaggggaa gcagagggtt 3241 tggctgggca aggattctga agggggtact tggaaaaccc aaagagcttg gaagaggtga 3301 gaagtggcgt gaagtgagca ggggagggcc tggcaaggat gaggggcaga ggtcagagga 3361 gttttggggg acaggcctgg gaggagacta tggaagaaag gggcctcaag agggagtggc 3421 cccactgcca gaattcctaa aaagatcatt ggccgtccac attcatgctg gctggcgctg 3481 gctgaactgg tgccaccgtg gcagttttgt tttgttttgc ttttttgcac ccagaggcaa 3541 aatgggtgga gcactatgcc caggggagcc cttcccgagg agtccagggg tgagcctctg 3601 tgatccccta atcaatctcc taggaatgga gggtagaccg agaaaaggct ggcatagggg 3661 gagtcagttt cccaggtaga agcaagaagg gtaccttttg ctcctcaccc tggatctctt 3721 ttccttccac ccaggtttct gcaggtaatg gtggcagcag cctctcttac acaaacccag 3781 cagtggcagc cacttctgcc aacttgtagg ggcacgtcgc ccgctgagct gagtggccag 3841 ccagtgccat tccactccac tcaggttctt cagggccaga gcccctgcac cctgtttggg 3901 ctggtgagct gggagttcag gtgggctgct cacacgtcct tcagaggccc caccaatttc 3961 tcggacactt ctcagtgtgt ggaagctcat gtgggcccct gaggctcatg cctgggaagt 4021 gttgtggtgg gggctcccag gaggactggc ccagagagcc ctgagatagc ggggatcctg 4081 aactggactg aataaaacgt ggtctcccac tggcgccaac ttctgatctt tcatctgtga 4141 cccgtgggca gcagggcgtc agaatgtgtg tgagggggct gggggaggag acagggaggc 4201 caggaggcag taaggagcga gtttgtttga gaagccaggg aga //