GenBank-Updates@genbank.bio.net (03/14/91)
LOCUS HUMPEM 4243 bp ds-DNA PRI 14-MAR-1991
DEFINITION Human polymorphic epithelial mucin (PEM) gene, complete cds.
ACCESSION M61170
KEYWORDS polymorphic epithelial mucin.
SOURCE Human blood lymphocyte cell and non-transformed helper T cell DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 4243)
AUTHORS Lancaster,C.A., Peat,N., Duhig,T., Wilson,D.,
Taylor-Papadimitriou,J. and Gendler,S.J.
TITLE Structure and expression of the human polymorphic epithelial mucin
gene: An expressed VNTR unit
JOURNAL Biochem. Biophys. Res. Commun. 173, 1019-1029 (1990)
STANDARD simple staff_entry
FEATURES Location/Qualifiers
mRNA join(804..933,1436..2265,2365..2420,2570..2706,2851..2972,
3053..3202,3735..4112)
/gene="PEM"
CDS join(876..933,1436..2265,2365..2420,2570..2706,2851..2972,
3053..3202,3735..3809)
/product="polymorphic epithelial mucin"
/gene="PEM"
/codon_start=876
exon 804..933
/number=1
/gene="PEM"
exon 1436..2265
/number=2
/gene="PEM"
exon 2365..2420
/number=3
/gene="PEM"
exon 2570..2706
/number=4
/gene="PEM"
exon 2851..2972
/number=5
/gene="PEM"
exon 3053..3202
/number=6
/gene="PEM"
exon 3735..4112
/number=7
/gene="PEM"
BASE COUNT 853 a 1233 c 1268 g 889 t
ORIGIN
1 tactcctctc cgcccggtcc gagcggcccc tcagcttgcg cggcccagcc ccgcaaggct
61 cccggtgacc actagagggc gggaggagct cctggccagt ggtggagagt ggcaaggaag
121 gaccctaggg ttcatcggag cccaggttta ctcccttaag tggaaatttc ttcccccact
181 cctccttggc tttctccaag gagggaaccc aggctgctgg aaagtccggc tggggggggg
241 actgtgggtt caggggagaa cggggtgtgg aacgggacag ggagcggtta gaagggtggg
301 gctattccgg gaagtggtgg ggggagggag cccaaaacta gcacctagtc cactcattat
361 ccagccctct tatttctcgg ccgctctgct tcagtggacc cggggagggc ggggaagtgg
421 agtgggagac ctaggggtgg gcttcccgac cttgctgtac aggacctcga cctagctggc
481 tttgttcccc atccccacgt tagttgttgc cctgaggcta aaactagagc ccaggggccc
541 caagttccag actgcccctc ccccctcccc cggagccagg gagtggttgg tgaaaggggg
601 aggccagctg gagaacaaac gggtagtcag ggggttgagc gattagagcc cttgtaccct
661 acccaggaat ggttggggag gaggaggaag aggtaggagg taggggaggg ggcggggttt
721 tgtcacctgt cacctgctcg ctgtgcctag ggcgggcggg cggggagtgg ggggaccggt
781 ataaagcggt aggcgcctgt gcccgctcca cctctcaagc agccagcgcc tgcctgaatc
841 tgttctgccc cctccccacc catttcacca ccaccatgac accgggcacc cagtctcctt
901 tcttcctgct gctgctcctc acagtgctta caggtgaggg gcacgaggtg gggagtgggc
961 tgccctgctt aggtggtctt cgtggtcttt ctgtgggttt tgctccctgg cagatggcac
1021 catgaagtta aggtaagaat tgcagacaga ggctgccctg tctgtgccag aaggagggag
1081 aggctaagga caggctgaga agagttgccc ccaaccctga gagtgggtac caggggcaag
1141 caaatgtcct gtagagaagt ctagggggaa gagagtaggg agagggaagg cttaagaggg
1201 gaagaaatgc aggggccatg agccaaggcc tatgggcaga gagaaggagg ctgctgcagg
1261 gaaggaggct tccaacccag gggttactga ggctgcccac tccccagtcc tcctggtatt
1321 atttctctgg tggccagagc ttatattttc ttcttgctct tatttttcct tcataaagac
1381 ccaaccctat gactttaact tcttacagct accacagccc ctaaacccgc aacagttgtt
1441 acaggttctg gtcatgcaag ctctacccca ggtggagaaa aggagacttc ggctacccag
1501 agaagttcag tgcccagctc tactgagaag aatgctgtga gtatgaccag cagcgtactc
1561 tccagccaca gccccggttc aggctcctcc accactcagg gacaggatgt cactctggcc
1621 ccggccacgg aaccagcttc aggttcagct gccacctggg gacaggatgt cacctcggtc
1681 ccagtcacca ggccagccct gggctccacc accccgccag cccacgatgt cacctcagcc
1741 ccggacaaca agccagcccc gggctccacc gcccccccag cccacggtgt cacctcggcc
1801 ccggacacca ggccggcccc gggctccacc gcccccccag cccatggtgt cacctcggcc
1861 ccggacaaca ggcccgcctt gggctccacc gcccctccag tccacaatgt cacctcggcc
1921 tcaggctctg catcaggctc agcttctact ctggtgcaca acggcacctc tgccagggct
1981 accacaaccc cagccagcaa gagcactcca ttctcaattc ccagccacca ctctgatact
2041 cctaccaccc ttgccagcca tagcaccaag actgatgcca gtagcactca ccatagcacg
2101 gtacctcctc tcacctcctc caatcacagc acttctcccc agttgtctac tggggtctct
2161 ttctttttcc tgtcttttca catttcaaac ctccagttta attcctctct ggaagatccc
2221 agcaccgact actaccaaga gctgcagaga gacatttctg aaatggtgag tatcggcctt
2281 tccttcccca tgctcccctg aagcagccat cagaactgtc cacacccttt gcatcaagcc
2341 cgagtccttt ccctctcacc ccagtttttg cagatttata aacaaggggg ttttctgggc
2401 ctctccaata ttaagttcag gtacagttct gggtgtggac ccagtgtggt ggttggaggg
2461 ttgggtggtg gtcatgaccg taggagggac tggtgcactt aaggttgggg gaagagtgct
2521 gagccagagc tgggacccgt ggctgaagtg cccatttccc tgtgaccagg ccaggatctg
2581 tggtggtaca attgactctg gccttccgag aaggtaccat caatgtccac gacgtggaga
2641 cacagttcaa tcagtataaa acggaagcag cctctcgata taacctgacg atctcagacg
2701 tcagcggtga ggctacttcc ctggctgcag ccagcaccat gccggggccc ctctccttcc
2761 agtgtctggg tccccgctct ttccttagtg ctggcagcgg gaggggcgcc tcctctggga
2821 gactgccctg accactgctt ttccttttag tgagtgatgt gccatttcct ttctctgccc
2881 agtctggggc tggggtgcca ggctggggca tcgcgctgct ggtgctggtc tgtgttctgg
2941 ttgcgctggc cattgtctat ctcattgcct tggtgagtgc agtccctggc cctgatcaga
3001 gccccccggt agaaggcact ccatggcctg ccataacctc ctatctcccc aggctgtctg
3061 tcagtgccgc cgaaagaact acgggcagct ggacatcttt ccagcccggg atacctacca
3121 tcctatgagc gagtacccca cctaccacac ccatgggcgc tatgtgcccc ctagcagtac
3181 cgatcgtagc ccctatgaga aggtgagatt ggccccacag gccaggggaa gcagagggtt
3241 tggctgggca aggattctga agggggtact tggaaaaccc aaagagcttg gaagaggtga
3301 gaagtggcgt gaagtgagca ggggagggcc tggcaaggat gaggggcaga ggtcagagga
3361 gttttggggg acaggcctgg gaggagacta tggaagaaag gggcctcaag agggagtggc
3421 cccactgcca gaattcctaa aaagatcatt ggccgtccac attcatgctg gctggcgctg
3481 gctgaactgg tgccaccgtg gcagttttgt tttgttttgc ttttttgcac ccagaggcaa
3541 aatgggtgga gcactatgcc caggggagcc cttcccgagg agtccagggg tgagcctctg
3601 tgatccccta atcaatctcc taggaatgga gggtagaccg agaaaaggct ggcatagggg
3661 gagtcagttt cccaggtaga agcaagaagg gtaccttttg ctcctcaccc tggatctctt
3721 ttccttccac ccaggtttct gcaggtaatg gtggcagcag cctctcttac acaaacccag
3781 cagtggcagc cacttctgcc aacttgtagg ggcacgtcgc ccgctgagct gagtggccag
3841 ccagtgccat tccactccac tcaggttctt cagggccaga gcccctgcac cctgtttggg
3901 ctggtgagct gggagttcag gtgggctgct cacacgtcct tcagaggccc caccaatttc
3961 tcggacactt ctcagtgtgt ggaagctcat gtgggcccct gaggctcatg cctgggaagt
4021 gttgtggtgg gggctcccag gaggactggc ccagagagcc ctgagatagc ggggatcctg
4081 aactggactg aataaaacgt ggtctcccac tggcgccaac ttctgatctt tcatctgtga
4141 cccgtgggca gcagggcgtc agaatgtgtg tgagggggct gggggaggag acagggaggc
4201 caggaggcag taaggagcga gtttgtttga gaagccaggg aga
//GenBank-Updates@genbank.bio.net (03/29/91)
LOCUS HUMPEM 4243 bp ds-DNA PRI 29-MAR-1991
DEFINITION Human polymorphic epithelial mucin (PEM) gene, complete cds.
ACCESSION M61170 X54350 X54351
KEYWORDS polymorphic epithelial mucin.
SOURCE Human blood lymphocyte cell and non-transformed helper T cell DNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 4243)
AUTHORS Lancaster,C.A., Peat,N., Duhig,T., Wilson,D.,
Taylor-Papadimitriou,J. and Gendler,S.J.
TITLE Structure and expression of the human polymorphic epithelial mucin
gene: An expressed VNTR unit
JOURNAL Biochem. Biophys. Res. Commun. 173, 1019-1029 (1990)
STANDARD simple staff_entry
FEATURES Location/Qualifiers
mRNA join(804..933,1436..2265,2365..2420,2570..2706,2851..2972,
3053..3202,3735..4112)
/gene="PEM"
CDS join(876..933,1436..2265,2365..2420,2570..2706,2851..2972,
3053..3202,3735..3809)
/product="polymorphic epithelial mucin"
/gene="PEM"
/codon_start=876
exon 804..933
/number=1
/gene="PEM"
exon 1436..2265
/number=2
/gene="PEM"
exon 2365..2420
/number=3
/gene="PEM"
exon 2570..2706
/number=4
/gene="PEM"
exon 2851..2972
/number=5
/gene="PEM"
exon 3053..3202
/number=6
/gene="PEM"
exon 3735..4112
/number=7
/gene="PEM"
BASE COUNT 853 a 1233 c 1268 g 889 t
ORIGIN
1 tactcctctc cgcccggtcc gagcggcccc tcagcttgcg cggcccagcc ccgcaaggct
61 cccggtgacc actagagggc gggaggagct cctggccagt ggtggagagt ggcaaggaag
121 gaccctaggg ttcatcggag cccaggttta ctcccttaag tggaaatttc ttcccccact
181 cctccttggc tttctccaag gagggaaccc aggctgctgg aaagtccggc tggggggggg
241 actgtgggtt caggggagaa cggggtgtgg aacgggacag ggagcggtta gaagggtggg
301 gctattccgg gaagtggtgg ggggagggag cccaaaacta gcacctagtc cactcattat
361 ccagccctct tatttctcgg ccgctctgct tcagtggacc cggggagggc ggggaagtgg
421 agtgggagac ctaggggtgg gcttcccgac cttgctgtac aggacctcga cctagctggc
481 tttgttcccc atccccacgt tagttgttgc cctgaggcta aaactagagc ccaggggccc
541 caagttccag actgcccctc ccccctcccc cggagccagg gagtggttgg tgaaaggggg
601 aggccagctg gagaacaaac gggtagtcag ggggttgagc gattagagcc cttgtaccct
661 acccaggaat ggttggggag gaggaggaag aggtaggagg taggggaggg ggcggggttt
721 tgtcacctgt cacctgctcg ctgtgcctag ggcgggcggg cggggagtgg ggggaccggt
781 ataaagcggt aggcgcctgt gcccgctcca cctctcaagc agccagcgcc tgcctgaatc
841 tgttctgccc cctccccacc catttcacca ccaccatgac accgggcacc cagtctcctt
901 tcttcctgct gctgctcctc acagtgctta caggtgaggg gcacgaggtg gggagtgggc
961 tgccctgctt aggtggtctt cgtggtcttt ctgtgggttt tgctccctgg cagatggcac
1021 catgaagtta aggtaagaat tgcagacaga ggctgccctg tctgtgccag aaggagggag
1081 aggctaagga caggctgaga agagttgccc ccaaccctga gagtgggtac caggggcaag
1141 caaatgtcct gtagagaagt ctagggggaa gagagtaggg agagggaagg cttaagaggg
1201 gaagaaatgc aggggccatg agccaaggcc tatgggcaga gagaaggagg ctgctgcagg
1261 gaaggaggct tccaacccag gggttactga ggctgcccac tccccagtcc tcctggtatt
1321 atttctctgg tggccagagc ttatattttc ttcttgctct tatttttcct tcataaagac
1381 ccaaccctat gactttaact tcttacagct accacagccc ctaaacccgc aacagttgtt
1441 acaggttctg gtcatgcaag ctctacccca ggtggagaaa aggagacttc ggctacccag
1501 agaagttcag tgcccagctc tactgagaag aatgctgtga gtatgaccag cagcgtactc
1561 tccagccaca gccccggttc aggctcctcc accactcagg gacaggatgt cactctggcc
1621 ccggccacgg aaccagcttc aggttcagct gccacctggg gacaggatgt cacctcggtc
1681 ccagtcacca ggccagccct gggctccacc accccgccag cccacgatgt cacctcagcc
1741 ccggacaaca agccagcccc gggctccacc gcccccccag cccacggtgt cacctcggcc
1801 ccggacacca ggccggcccc gggctccacc gcccccccag cccatggtgt cacctcggcc
1861 ccggacaaca ggcccgcctt gggctccacc gcccctccag tccacaatgt cacctcggcc
1921 tcaggctctg catcaggctc agcttctact ctggtgcaca acggcacctc tgccagggct
1981 accacaaccc cagccagcaa gagcactcca ttctcaattc ccagccacca ctctgatact
2041 cctaccaccc ttgccagcca tagcaccaag actgatgcca gtagcactca ccatagcacg
2101 gtacctcctc tcacctcctc caatcacagc acttctcccc agttgtctac tggggtctct
2161 ttctttttcc tgtcttttca catttcaaac ctccagttta attcctctct ggaagatccc
2221 agcaccgact actaccaaga gctgcagaga gacatttctg aaatggtgag tatcggcctt
2281 tccttcccca tgctcccctg aagcagccat cagaactgtc cacacccttt gcatcaagcc
2341 cgagtccttt ccctctcacc ccagtttttg cagatttata aacaaggggg ttttctgggc
2401 ctctccaata ttaagttcag gtacagttct gggtgtggac ccagtgtggt ggttggaggg
2461 ttgggtggtg gtcatgaccg taggagggac tggtgcactt aaggttgggg gaagagtgct
2521 gagccagagc tgggacccgt ggctgaagtg cccatttccc tgtgaccagg ccaggatctg
2581 tggtggtaca attgactctg gccttccgag aaggtaccat caatgtccac gacgtggaga
2641 cacagttcaa tcagtataaa acggaagcag cctctcgata taacctgacg atctcagacg
2701 tcagcggtga ggctacttcc ctggctgcag ccagcaccat gccggggccc ctctccttcc
2761 agtgtctggg tccccgctct ttccttagtg ctggcagcgg gaggggcgcc tcctctggga
2821 gactgccctg accactgctt ttccttttag tgagtgatgt gccatttcct ttctctgccc
2881 agtctggggc tggggtgcca ggctggggca tcgcgctgct ggtgctggtc tgtgttctgg
2941 ttgcgctggc cattgtctat ctcattgcct tggtgagtgc agtccctggc cctgatcaga
3001 gccccccggt agaaggcact ccatggcctg ccataacctc ctatctcccc aggctgtctg
3061 tcagtgccgc cgaaagaact acgggcagct ggacatcttt ccagcccggg atacctacca
3121 tcctatgagc gagtacccca cctaccacac ccatgggcgc tatgtgcccc ctagcagtac
3181 cgatcgtagc ccctatgaga aggtgagatt ggccccacag gccaggggaa gcagagggtt
3241 tggctgggca aggattctga agggggtact tggaaaaccc aaagagcttg gaagaggtga
3301 gaagtggcgt gaagtgagca ggggagggcc tggcaaggat gaggggcaga ggtcagagga
3361 gttttggggg acaggcctgg gaggagacta tggaagaaag gggcctcaag agggagtggc
3421 cccactgcca gaattcctaa aaagatcatt ggccgtccac attcatgctg gctggcgctg
3481 gctgaactgg tgccaccgtg gcagttttgt tttgttttgc ttttttgcac ccagaggcaa
3541 aatgggtgga gcactatgcc caggggagcc cttcccgagg agtccagggg tgagcctctg
3601 tgatccccta atcaatctcc taggaatgga gggtagaccg agaaaaggct ggcatagggg
3661 gagtcagttt cccaggtaga agcaagaagg gtaccttttg ctcctcaccc tggatctctt
3721 ttccttccac ccaggtttct gcaggtaatg gtggcagcag cctctcttac acaaacccag
3781 cagtggcagc cacttctgcc aacttgtagg ggcacgtcgc ccgctgagct gagtggccag
3841 ccagtgccat tccactccac tcaggttctt cagggccaga gcccctgcac cctgtttggg
3901 ctggtgagct gggagttcag gtgggctgct cacacgtcct tcagaggccc caccaatttc
3961 tcggacactt ctcagtgtgt ggaagctcat gtgggcccct gaggctcatg cctgggaagt
4021 gttgtggtgg gggctcccag gaggactggc ccagagagcc ctgagatagc ggggatcctg
4081 aactggactg aataaaacgt ggtctcccac tggcgccaac ttctgatctt tcatctgtga
4141 cccgtgggca gcagggcgtc agaatgtgtg tgagggggct gggggaggag acagggaggc
4201 caggaggcag taaggagcga gtttgtttga gaagccaggg aga
//